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Preface 


This book is rooted in thirty years of managing Massachusetts Institute of Technology 
(MIT) Lincoln Laboratory activities that deal in the application of adaptive antenna 
array technology to the sensing of high frequency (HF) communication signals. At 
the start, it was unclear whether such techniques would be successful in suppress¬ 
ing interference and accurately estimating directions of arrival. Because HF signals 
from distant transmitters arrive at a sensor after ionospheric refraction along roughly 
inverted-U-shaped paths, rather than by the straight line-of-sight paths experienced by 
communication signals at very high frequency (VHF) and above, there were doubts 
about the efficacy of antenna array methods. Many of these are based on the plane 
wave phenomenology associated with line-of-sight propagation. Initial efforts focused 
on demonstrating the potential of adaptive array techniques at HF, but as the expe¬ 
rience base grew, research expanded into quantitative assessments of array-processing 
effectiveness and its dependence on antenna array characteristics. 

The primary goal here is to capture the technology underlying this experience 
base, providing practicing engineers with a single, self-consistent reference on adap¬ 
tive array technology in the HF communications domain. After years of sorting 
through technical papers with widely varying terminology and notation, it seemed 
that such a reference would be useful. We hope that this primary goal has been 
achieved. Secondary goals have been to elucidate relationships among the various 
methods in current use, and to correlate achievable performance with array geometry 
and signal-processing approaches. 

The technical material presented here starts with admittedly superficial reviews of 
HF propagation phenomenology, communication signal formats, and HF receiver 
architectural approaches. This material is intended to provide a common technical and 
notational base to a varied audience. It then outlines the goals of signal sensing before 
launching into the core material with a description of the vector signal model used 
throughout. This model is then utilized in descriptions of signal detection and line- 
of-bearingf estimation techniques; the second of these leads naturally into a discussion 
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of geolocation methods. Finally, the last two chapters cover signal estimation, widely 
identified as signal “copy.” 

Readers of this monograph are assumed to have experience with linear algebra and 
be knowledgeable in the areas of ionospheric phenomenology and communication- 
signal characteristics. Although the early chapters do review these areas and are 
designed to provide an “on-ramp” for the reader, they cannot be regarded as texts 
on these subjects. More detailed background is readily available in the texts referenced 
in chapters 2 and 3. 
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Introduction 


1.1 Overview 

As most readers know, the electromagnetic spectrum is divided into segments based 
on the radio frequency (RF) of the relevant electromagnetic wave. The high-frequency 
(HF) segment of this spectrum is specified to be 3 to 30 MHz. Waves at this fre¬ 
quency are refracted as they pass through the Earths atmospheric layer known as the 
ionosphere as a result, a wave transmitted skyward by a ground-based antenna may 
not continue in a straight line but may be bent back toward the Earth, where it can 
be received by an appropriately tuned radio receiver. The details of this phenomenon 
depend on the specific radio frequency, on the launch angle of the wave, and of course 
on the ionization conditions. Favorable propagation conditions are common enough 
to have spawned a wide variety of HF communications-signaling applications, includ¬ 
ing transcontinental radio operations, commonly known as shortwave (SW) radio, 
and over-the-horizon radar (OTHR) 

But the often-favorable propagation conditions that induced these applications, and 
their associated long communication ranges, have led to a locally crowded spectrum. 
The spectrogram of figure 1.1, often called a waterfall plot , illustrates this crowding. 
It shows the power present in each narrowband frequency cell in a 2 MHz frequency 
span, during a 10 second interval, as a function of the frequency specified by the ordi¬ 
nate. Many signals continue unabated over this interval; others last only a short time. 
Noise bursts abound. 

Such spectral crowding is common in the HF band. After all, a transmitted HF 
signal that can reach a distant intended receiver will also illuminate a large area where 


1 From 80 to ~400 km in altitude. 
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Figure 1.1 Spectrogram (waterfall plot) example 

many unintended receivers are located; these receivers may be communicating with 
other transmitters, but the undisciplined nature of most HF signaling protocols leaves 
many possibilities for multiple users to employ a common frequency. Such cochannel 
interference is therefore a very common occurrence at HF, and as a result, the need 
for interference mitigation approaches is clear. Adaptive antenna technology, a power¬ 
ful interference-suppression technology based on multielement antenna arrays at the 
receiver to exploit spatial differences among cochannel signals, has recently become 
affordable for such an application. This technology is the primary focus of this work. 

Adaptive antenna arrays have been an academic study topic for decades. Elements of 
such arrays are equipped with hardware that can modify the amplitudes and phases of 
each element signal, each represented mathematically as a complex signal, by a complex 
weight prior to their summation in a beamformer. These weights are computed adap¬ 
tively to instill the beamformer with the desired characteristics. Mathematical analysis 
and computer simulations have shown such systems to have great power in suppress¬ 
ing interference and in mitigating multipath effects. Yet until recently, implementation 
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costs have denied the benefits of this technology for many applications. Today, modern 
electronics technology makes such adaptive antenna systems practicable and afford¬ 
able. As a result, these systems are now appearing in an increasing number of military 
and commercial settings. 

Initially, these applications focused on environments where line-of-sight propaga¬ 
tion phenomenology prevailed. Early examples included applications in radar and 
communications. The successes in these situations invited the exploration of more 
complex scenarios, especially in the communications field. In particular, HF commu¬ 
nications in which the long-range propagation effects and limited spectrum guarantee 
cochannel interference in a multipath-rich environment became a target for the emerg¬ 
ing adaptive antenna technology. As any user of the FIF band will attest, this domain 
cries out for help. 

This book addresses the theory and application of modern adaptive antenna arrays 
as applied to HF signal sensing and reception. Earlier books [1, 2, 3] have covered 
classical adaptive array theory in some detail and have provided the foundation for 
many arrays operating today. But a number of recent theoretical developments espe¬ 
cially applicable to the multipath-rich HF sensing/reception problem have not been 
treated in a comprehensive, self-consistent way. Furthermore, although most of these 
developments have been published in a variety of journals, notational approaches vary 
widely. Consequently, reconciliation of the recent results and comparisons of them 
with classical theory becomes tedious at best. This shortfall is the central theme here. 
Thus, the goals of this book are to: 

• Briefly describe the HF signal environment to provide background for readers 
unfamiliar with the challenges facing an HF receiver system. 

• Define an appropriate mathematical framework for describing a wide range of 
theoretical adaptive antenna array models. 

• Present the theoretical foundations for those array-processing algorithms appli¬ 
cable to the HF communications regime. 

• Provide example algorithm results based on simulations of appropriate HF signal 
scenarios. 

Thus, the audience for this book includes members of both the civil and military 
communities who are involved with processing HF signals, especially those received in 
an interference-rich environment. Although the antenna array-based techniques con¬ 
sidered here are likely to impose additional costs for the antenna and the multichannel 
processing, often the interference environment is such that these approaches are the 
only options with potential for success. One hope is that this work will stimulate 
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the use of advanced adaptive antenna array algorithms in various HF communication 
applications and will inspire the development of even more sophisticated algorithms 
for improved performance in this complex, unpredictable, and fascinating propagation 
environment. 

1.2 High-Frequency Propagation 

Almost since the beginning of radio, the term HF communications has referred to com¬ 
munications in the frequency band from 2 to 30 MHz. At these frequencies, radio 
waves transmitted by ground-based antennas can be refracted by various layers of the 
ionosphere, returning sufficient energy to Earth for reliable ground-to-ground, over- 
the-horizon signal reception. But ionospheric layers are the result of solar heating, and 
as a result, the spatial, spectral, and temporal characteristics of the signals reaching 
a receiving antenna array become intimately tied to the diurnal physics of the iono¬ 
sphere along the signal path. As a result, any treatise on HF signal processing would be 
incomplete without at least superficial discussion of ionospheric phenomenology as it 
effects HF radio signal propagation. Such an introduction is included in chapter 2. 

The atmosphere above the Earth consists of a gaseous mixture with an approx¬ 
imately exponential decreasing density, characterized by a decay constant of about 
7 km. As the sun illuminates this atmosphere, solar ultraviolet rays ionize air gas 
molecules in an absorption process. But as the suns rays penetrate deeper into the 
atmosphere, their energy decreases rapidly due to the increase in the ambient air den¬ 
sity. Overall, the concentration of free electrons resulting from this ionization is low 
near the Earth surface, increases with altitude, becomes greatest at about 400 km alti¬ 
tude, and then declines as the atmosphere fades away to the nothingness of free space. 
This overall picture is illustrated in figure 1.2. 

Also, as seen in figure 1.2, this superficially smooth model is actually modulated by 
irregularities that are often described as layers. A layer exists where the free electron 
density gradient is sufficient to refract a radio signal back toward the Earths surface. 
Although this refraction is gradual and causes a signal to traverse a curved path, the 
phenomenon can be viewed simply as a reflection. In fact, the layers are only partially 
reflective with much of the signal energy continuing through the layer. 

Quantitatively, the influence of the ionized medium on upwardly propagating radio 
signals depends on free electron density, ambient magnetic field, signal frequency, sig¬ 
nal polarization, and transmitter-receiver-layer geometry, but as just discussed, the first 
of these is driven by ultraviolet radiation from the sun and thus depends on time of day, 
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Figure 1.2 Atmospheric ion density vs. altitude 

season, altitude, latitude and sunspot number. In addition to reflections, signal energy 
absorption can occur when the free electron density is high. These various effects make 
propagation phenomenology quite variable and difficult to describe concisely. 

From the standpoint of this book, only a few basic ideas are important: 

( 

• The ionospheric reflections allow signals to be received well beyond the optical 
horizon, even at transcontinental distances. 

• Signal strength at the receive station varies a great deal with time of day, season, 
and frequency. 

• A transmitted signal can reach a specific receive station by several different length 
paths, resulting in severe fading as the received signal components combine with 
different phases at the receive system antenna. 

• Such long-distance propagation is responsible for significant levels of cochannel 
interference that must somehow be mitigated if the intended communication 
link is to operate successfully. 

These factors influence the strategies for reliable communications in the HF band 
and the signal-processing algorithms that can be used to improve them. 
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1.3 HF Band Utilization 

Because of its potential for long-range communication at relatively low cost, use of the 
HF band has flourished despite its complex propagation phenomenology. Shortwave 
broadcast, emergency messaging, ship-to-shore telegraphy, and amateur radio trans¬ 
missions are some of the most well-known applications. But a relatively unstructured 
frequency-management approach, arising from the uncertain nature of the propaga¬ 
tion medium, the uncertain maximum communication range, and the international 
regulation environment associated with the long-range nature of HF communications, 
has led to great variability in frequency occupancy across the band. There are no well- 
defined frequency slots or standardized signal bandwidths. As a result, transmitting 
users can never be sure of having a clear broadcast channel, and receivers can never be 
sure that only one signal will be present at a specific frequency. 

Originally, most HF radio traffic could be identified as voice or Morse code (collo¬ 
quially, CW, or continuous wave). But today, many HF users are transmitting data at a 
variety of rates, with a variety of signal modulations. A very recent protocol, automatic 
link establishment (ALE), has developed a significant following. This protocol uti¬ 
lizes a 2.4 kHz frequency-shift keyed (FSK) signal format with an adaptive frequency 
selection protocol, and has been used as a foundation for a low-cost computer-to- 
computer networking system. Chapter 3 covers a number of common HF signal 
formats indicating their instantaneous frequency occupancy. 

A set of spectrograms covering the entire HF band, from 2 to 30 MHz, as a function 
of time of day would show the great variability in the signal environment experienced 
at HF. The dynamic range is very wide, and weak signals are often overwhelmed by 
much stronger cochannel signals; such characteristics significantly limit the ability of 
a single-channel receiver system to receive a weak signal. Multichannel receivers, with 
their outputs processed by adaptive antenna array algorithms, have the potential for 
much-improved prosecution of such weak signals. This potential is explored herein. 

1.4 Motivations for Adaptive Antenna Array Application at HF 

Propagation phenomenology and spectral occupancy associated with HF communi¬ 
cations have been outlined in the previous sections. Although the description was 
superficial, the reader should come away with this message: the HF band is char¬ 
acterized by extreme variability and crowding. If one tunes a narrowband (2.4 kHz 
bandwidth) receiver to any specific frequency, one is likely to find several signals 
within its bandwidth. In addition, at least some of the signals in this bandwidth will 
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have originated from the same transmitter but will have been refracted by different 
ionospheric layers and thus will exhibit different Doppler offsets and time delays. The 
strength of the various signals will also manifest a large dynamic range, often exceeding 
60 dB. 

Thus, an HF receiver is faced with a challenging signal environment. Co-channel 
interference and substantial levels of multipath abound, compromising quality recep¬ 
tion. These challenges are the very ones that adaptive antenna array technology has 
been developed to address. Its success in suppressing interference in the line-of-sight 
radio communications environment for frequencies from VHF (30 to 300 MHz) 
through L-band (~1500 MHz) [1, 2, 3], in radar interference suppression at frequen¬ 
cies from UHF (300 to 600 MHz) through C-band (5000 MHz) [4], and in passive 
sonar where there is little control over the form of the signals to be detected and pro¬ 
cessed has been documented [5,6]. It is unnecessary to summarize these successes here. 
Needless to say, the evidence suggests that adaptive antennas could well play a role in 
the quality reception of HF communication signals. 

However, there is one important issue that is much more prominent in the HF 
band: signal models used in adaptive antenna array-processing technology always 
assume plane wave propagation, yet HF skywave signals have been refracted by the 
ionosphere and reach the receiver system by much more complex propagation phe¬ 
nomenology than in line-of-sight communications. At the very least, this aspect of 
HF communications signaling injects an iota of uncertainty about the applicability 
of conventional adaptive antenna array technology in the HF environment. A first 
step necessarily involves assessment of the line-of-sight, plane wave signal model to 
HF. Once this basic question has been resolved and the potential for performance 
enhancement is established, it will be appropriate to quantitatively assess the extent of 
adaptive antenna performance benefits. 

1.5 Overall Plan of the Book 

With the stage set for presenting adaptive antenna theory and its application to HF 
communications in detail, a brief outline of the succeeding chapters will be described. 
First, a more detailed description of the HF propagation environment will be pre¬ 
sented in chapter 2. This chapter will motivate the multipath models used in later 
discussions of adaptive array algorithms and will provide an overall characterization of 
the temporal, spectral, and spatial effects of the ionospheric refraction on the transmit¬ 
ted signals as they propagate from their sources to a receiving system equipped with 
an array antenna. 
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Next, chapter 3 will provide a description of the more important modulation 
approaches and signaling protocols used at HF. It will include a discussion of how HF 
propagation affects a particular modulation scheme and how these effects influence 
the choice of signal protocol parameters. 

After these descriptions of the propagation environment and its component sig¬ 
nals, an overview of the important receiver system architectural features needed in an 
HF receiver system is presented in chapter 4. This chapter will cover coherence and 
dynamic range requirements for a suitable receiver module to be included in an HF 
receiving system. Also, digitization of the receiver outputs will be discussed, leading 
to a specification for an appropriate analog-to-digital converter (ADC). A brief com¬ 
parison of two commercially-available ADCs will be given to illustrate the evaluation 
process. No attempt will be made to suggest particular ADCs; such an endeavor will 
be outdated before this book reaches the readers hands. 

Chapter 5 will address two important topics, the signal processor architecture and 
its implementation. Throughout this book, the focus will primarily be on narrowband 
signals. In actual system practice, a wideband system may be built and operated, but 
the architecture includes an initial stage of signal processing using digital methods 
to break the signal seen at each array element into a dense collection of narrowband 
frequency subchannel signals. Channelization processing is necessary to accomplish 
this. Then, the narrowband algorithms covered in this text can be applied to individ¬ 
ual subchannels separately. Methods to accomplish the creation of multiple adjacent 
subchannels are covered here, addressing issues such as subchannel leakage and signal 
fidelity. 

Chapter 6 will provide the technical foundation for this book with a description of 
the vector model for a signal received at an adaptive array antenna. It will start with 
a summary of the necessary mathematical background, not only to fill gaps in the 
reader s knowledge but also to present the notation to be used when specific algorithms 
are described later. The basics of wave propagation will be reviewed and presented in 
the coordinate system that will be used throughout the book. A simple array example 
is included to explain the importance of aperture in resolving individual signals in a 
multisignal environment. The notation needed to address antenna arrays that manifest 
polarization diversity will be described. Finally, this chapter will describe the sum and 
difference processing associated with conventional monopulse antenna systems. 

Next, attention shifts to actual signal-processing techniques starting with the detec¬ 
tion topic. Chapter 7 addresses the important question: “Is there a signal present?” At 
HF this question has a critical extension due to the crowded nature of the HF band: 
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“Is there a new signal present?” And to the further, not unrelated question: “Has a 
previously detected signal turned off?” And to another very important question: “Does 
a new energy alarm correspond to a signal of interest?” Here again, examples of these 
algorithms applied to HF channel data are given. 

Next, chapters 8 and 9 will consider the direction-finding goal. Starting with the 
computation of relevant performance bounds in chapter 8, the topic will continue with 
a myriad of specific direction-finding algorithms in chapter 9. This chapter will include 
a discussion of array calibration and how it can contribute to achieving direction¬ 
finding results that approach the bounds described in chapter 8. Example results, using 
simulated HF channel data, are included. 

Chapter 10 addresses an important topic for consumers of the direction-finding 
results, geolocation. This term refers to methods for estimating the location of the source 
from which a signal has been detected and for which a direction has been estimated. 
Methods include those that utilize lines of bearing from a single sensing system, as well 
as those that combine such estimates from multiple sensors. 

Finally, chapters 11 and 12 will focus on signal estimation, often referred to as copy. 
This term arises from the common communications operator lingo “Do you copy me?” 
In effect, this refers to determining a set of complex antenna element weights that can 
be applied to the signals observed at the various array elements, thereby producing a 
single receiver system output that is an estimate of the transmitted signal. Chapter 11 
. focuses on methods that estimate a signal steering vector and use this estimate to com¬ 
pute copy weights. In contrast, chapter 12 exploits a priori knowledge of a specific 
signal type to compute appropriate antenna element weights. Least mean square meth¬ 
ods, the earliest approach used for signal copy, is one example of this class, but constant 
amplitude signal copy is a more important algorithm for HF signals where phase-shift 
keyed signals are frequently transmitted. Example results of these algorithms applied to 
HF channel data are given with each algorithm; also included is a discussion of weight 
update approaches that are necessary if the dynamic nature of HF signal propagation 
is to be addressed effectively. 
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HF Propagation Physics and Its Effect on Signals 


2.1 Ionospheric Medium 

This section describes the ionospheric medium and the natural influences that affect 
its characteristics. 

2.1.1 Overview 


The roughly spherical Earth is surrounded by an atmosphere composed of nitrogen, 
oxygen, carbon dioxide, and other gases; this atmosphere is densest near the Earths 
surface, but as indicated in figure 2.1, its density decreases approximately exponentially. 
The lapse constant is about —7.027 km, which is the altitude change resulting in a 
decrease in density by a factor 1/e. Thus, at an altitude of 50 km, the atmosphere is 
^(—50/7.027) _ 0.0008125 times the density at the surface. 

This “upper” atmosphere is bombarded by the suns radiation, ionizing the air and 
generating free electrons and ionized molecules of both positive and negative per¬ 
suasion. Energy at short ultraviolet (UV) wavelengths is especially effective in its 
interactions with the atmosphere. Free electrons are released from neutral atoms, 
creating positive ions, but these electrons can in turn bombard neutral atoms and 
be captured to form negative ions. As a result, this upper atmosphere is an ionized 
medium, or plasma, that is known as the ionosphere. An impinging electromagnetic 
wave may propagate through this plasma, or not, depending on various factors detailed 
later in this chapter. This section concentrates on ionospheric characteristics; the 
next section outlines the basic interaction between such a propagating wave and the 
ionosphere. 
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Altitude (m) x 10 4 


Figure 2.1 Standard atmospheric density vs. altitude compared to exponential fit 

Of course, at any specific location in the ionosphere, solar flux is not constant. Even 
if the suns radiation did not vary, which it does, Earth rotation imposes a diurnal cycle, 
and the Earths annual orbit around the sun, combined with the tilt of the Earth axis 
with respect to the ecliptic plane, superimposes a second cyclic component on the 
impinging radiation. But there are also variations in the solar emission level due to 
thermal activity on the sun, including solar flares and coronal mass ejections (CMEs). 
These solar events also have cyclic features. Solar flares manifest a period synchronous 
with the eleven-year sunspot cycle; the effects of CMEs show variations that follow 
the twenty-eight-day rotation period of the solar sphere. In addition to these multiple 
cyclic variations in solar flux at a specific location, there are variations induced by 
magnetic storms that have unstructured temporal variations. 

But in addition to these interactions of the sun with the Earths atmosphere, there 
is another source of ionization, that produced by meteorites captured by Earth gravity. 
These penetrate the atmosphere, give up their energy, and ionize gas molecules encoun¬ 
tered along their path. This contributor to the ionospheric characteristics is especially 
important at higher latitudes in the winter when solar insolation is at its ebb. 

Thus, ionospheric physics is fairly complex, resulting in a complicated ionospheric 
geography. The composite effects of the various physical processes produce an iono¬ 
sphere that varies with altitude, latitude, sunspot number, time-of-day, and season. 
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Fortunately, scientists have observed ionospheric conditions for more than a century 
and have been able to develop models for those features important to HF commu¬ 
nications. These models have been incorporated into teadily available and effective 
computer codes. The most frequently used of these models will be described briefly 
later in this chapter. " 

2.1.2 Solar Wind 

Radiation and particle flow from the sun make up the so-called solar wind, which 
interacts with the Earth. Solar wind is a term given to the ionized gas, or plasma, that 
is continually ejected from the sun as a result of its inherent thermonuclear activity. 
The gross structure of the solar wind phenomenon, as it reaches Earth, is illustrated 
in figure 2.2. Because the plasma is fully ionized, it can be regarded as a conducting 
medium that, as it flows outward from the sun, encounters the Earths magnetic field. 
The plasma cannot easily penetrate the Earths magnetic field, which represents an 
obstruction to the flow. Much as a submerged boulder is an obstruction to water flow 
in a mountain stream, the Earths magnetic field interacts with the plasma, distorting 
the magnetic field from the simple magnetic dipole structure that would exist if there 
were no solar wind. The result is a compressed magnetic field extending approximately 
ten Earth radii on the sunward side and an elongated tail extending approximately 
hundred Earth radii in the Earths shadow. The envelope of the resultant distorted 



Figure 2.2 Interaction of the solar wind with Earth and its atmosphere [1] 
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magnetic field is known as the magnetopause. As the solar wind penetrates to lower 
altitudes inside the magnetopause, these interactions change due to the integrated 
effects on the radiation spectrum and the atmospheric species mix that remains after 
interactions at higher altitudes. Further, geometry plays a role because the radiation 
flux is reduced as the zenith angle increases, as shown in figure 2.3. This latter effect is 
the same one producing more sunburn in summer when the sun is directly overhead. 
The consequence is the fairly complex magnetosphere, the asymmetrical region sur¬ 
rounding the Earth extending from about 100 to several thousand kilometers altitude, 
as figure 2.2 shows. 

The level of ionization must exceed ~10 8 ions/m 3 for it to have an appreciable 
effect on a propagating electromagnetic wave. During the day, short UV radiation 
is strong enough to generate such ionization levels down to 65 km altitude. But at 
night, when solar radiation is not providing energy to form new ions, recombination 
processes dominate, and at the lower altitudes, the atmosphere returns to its neutral 
state. At higher altitudes, however, some solar UV-induced ions remain uncombined, 
and, together with the meteorite-induced ions provide sufficient ion densities to leave 
a sensible ionosphere above 100 km altitude. 
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2.1.3 Altitude Variations 

The detailed interaction of the solar radiation with the atmosphere is complex and 
beyond the scope of this work. Nevertheless, it can be summarized by noting that the 
absorption cross section of the atmospheric atoms depends on the species and on radi¬ 
ation wavelength. Thus, the radiative energy spectrum evolves as it penetrates deeper 
into the atmosphere. As a result, the ion and free-electron densities become strong 
functions of altitude. Figure 2.4 provides four example electron-density profiles corre¬ 
sponding to noon and midnight for both a low and a high sunspot number. Generally, 
these profiles manifest a low electron density near the Earths surface, and a gradual 
increase with altitude. Peak density occurs at ~300+ km altitude, at summertime 
noon when the sunspot number is high. 

In addition to neutral winds, there are other weather-like dynamic processes driv¬ 
ing ionospheric dynamics well into the upper atmosphere, for example, plasma drifts 
driven by electric fields, which in turn are generated by a variety of neutral wind, 
magnetic field, and plasma interactions. As altitude increases, the radiative-chemistry 
and plasma-neutral wind effects dominate over the neutral atmospheric processes of 
the troposphere and mesosphere (e.g., turbulent mixing). These interactions produce 
an altitude-dependent, irregular electron-density profile that is often characterized 
using the term layers , even though the altitude variation is more subtle than that term 
implies. 



Figure 2.4 Election density vs. altitude: various times of day and sunspot number [1] 
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Although boundaries between these layers are not distinct, there are five terms used 
to describe them. The lowest of these, the D layer, extends from 65 to 90 km and 
primarily induces signal attenuation during the daytime hours. Above it is the E layer, 
from 90 to 140 km; higher yet are two layers, Fi and F 2 , with the first representing a 
daytime layer around 170 km and the second a nighttime layer above 250 km extending 
to as high as 500 km. As the diurnal cycle transitions from night to day in the morning 
and then back again in the evening, both Fi and F 2 layers can be present. Finally, there 
is sporadic E, an erratic phenomenon with a fairly narrow altitude extent at about 

105 km. 

Solar flares are the result of plasma instabilities in the suns outer layers (convection 
zone, photosphere, chromosphere, and corona) driven by a magneto-dynamic process. 
If a solar flare occurs within view of the Earth, a few minutes later brief bursts of 
energetic protons reach the Earth. If the flare produces energetic particles (a solar 
particle event [SPE]) and field lines emanating from the flare connect to Earth, then 
these particles arrive at Earth on the order of tens of minutes later. Furthermore, if 
lower-energy coronal mass is ejected during the event (CME), it will reach Earth in 
a period of one to three days. Energetic particles increase ionization at low altitudes, 
in particular in the D layer. CMEs reaching the Earths magnetosphere drive currents 
in the ionosphere that cause variations in electron density, predominantly at higher 
altitudes. 

Both SPEs and CMEs contribute to the ionizing processes already occurring in the 
atmosphere as a result of the more normal short-UV solar radiation. Although these 
solar flares are short lived, they precipitate a dramatic increase in ionospheric plasma 
density, especially at D layer altitudes. Effects on E and F layers are less significant 
because at their altitudes the ion density is low enough to make collisions rare; as a 
result, there is insignificant nondeviative absorption, the process by which a propa¬ 
gating electromagnetic wave is attenuated without deviation from a straight-line path. 
More discussion of this phenomenon is covered in the next section. Since solar flares 
are more common when the sunspot number is high, these sudden increases in ioniza¬ 
tion, though random, follow the eleven-year sunspot cycle in their overall frequency 
of occurrence. 

2.1.4 Geographic Variation 

Another unpredictable ionospheric feature is identified by the term traveling iono¬ 
spheric disturbances , or TIDs. TIDs are a common ionospheric feature characterized 
by wavelike disturbances in the electron-density profile. They are driven by both 
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gravity waves propagating up from the neutral atmosphere and electrodynamic effects 
resulting from geomagnetic disturbances. 

These general characteristics of the ionosphere are modified in ways that depend 
on geographic latitude. Four specific latitude bands manifest identifiable character¬ 
istics: equatorial, midlatitude, auroral, and polar. A brief summary of the principal 
differences among them follows. 

Equatorial ionosphere. As will become apparent when the auroral and polar iono- 
sph eres are discussed, the equatorial ionosphere manifests greater plasma density 
irregularities than the midlatitude ionosphere. The underlying causes for these fea¬ 
tures lie with the orientation of the Earths magnetic field, often modeled as a 
magnetic dipole tilted approximately 15° to the Earths rotation axis. At equatorial 
latitudes, the magnetic field lines are almost parallel to the Earths surface. So-called 
equatorial spread F (ESF), also known as equatorial clutter, involves low-electron- 
density globules that rise from the bottom to the top of the F layer shortly after 
sunset when the solar flux abruptly ends. These bubbles can be as small as a few 
centimeters in diameter or as large as 1000 km across. Because the loss of sunlight 
is their root cause, these bubbles move away from the sun, that is, in an easterly 
direction. 

Midlatitude ionosphere. Many factors already mentioned are important contribu¬ 
tors to those ionospheric features affecting HF signal propagation: the day/night 
terminator, spread F, solar flares, magnetic storms, TIDs, and sporadic E. At mid¬ 
latitudes, sporadic E can persist for many hours. Ionic motion induced by neutral 
air molecule wind shears in the lower atmosphere, interacting with the Earths mag¬ 
netic field, causes the ions to form thin layers, already identified as sporadic E, at 
105 km altitude. The density of such layers can exceed that of the F layer above. 

Magnetic storms also have important effects at midlatitudes by influencing the 
shape and peak density of the F layer. Because the F layer undergoes significant 
changes at sunrise and sunset, the quantitative significance of these effects depends 
on the time of day. 

In addition, midlatitude spread F occurs occasionally, only at night, but its 
cause is not well understood. As suggested by its name, spread F is manifested by 
unusually extended F layers, with associated broadening of temporal and Doppler 
distribution effects on HF signals. 

Auroral ionosphere. The auroral belt is the region at high latitudes where particle 
precipitation is greatest, with particles originating in the Earths outer radiation 
belt, the magnetosphere, and the interplanetary environment. During the day, 
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sunlight produces a dense-enough plasma to mask other contributions to ioniza¬ 
tion; in summer, this situation also prevails at night. But at night during the other 
seasons, sunlight is insufficient to influence the formation of ions; then particle pre¬ 
cipitation, as well as diffusion from sunlit portions of the upper latitude atmosphere, 
becomes the primary cause of ionization. 

These effects can be summarized by noting that electrons streaming earthward 
follow paths prescribed by geomagnetic field lines inducing ionization at both E and 
F layer altitudes. In addition, the F layer is filled with plasma “blobs 5 diffusing from 
the midnight sector of the polar cap. Both the solar wind plasma and the geomag¬ 
netosphere influence the process, which strengthens during magnetic storms. The 
result of these multiple processes is an auroral ionosphere that is temporally and spa¬ 
tially inhomogeneous and unstable. Its effects on HF signals are likewise extremely 
variable and unpredictable. 

Polar ionosphere. The effects described for the auroral ionosphere are even more evi¬ 
dent in the polar region. Rotation of the neutral atmosphere along with the Earth 
distorts ion patches, which form on the noontime side of the polar cap, and convect 
toward the midnight side; the results are elongated areas, or “troughs,” with irreg¬ 
ular ion density (“blobs”). Such structures are common at F layer altitudes. They 
are characterized by a steep electron gradient at the boundary with the auroral oval, 
causing substantial HF wavefront scattering based on the mechanisms described in 
the next section. 

2.2 Wave Propagation in the Ionosphere 

Now that the properties of the ionosphere have been described, it is appropriate to 
consider how a radio wave propagates in such a medium. This section begins with 
wave propagation basics, continues with descriptions of ray tracing ,, and introduces the 
notion of transmission curves . The effects of a disturbed ionosphere on a propagating 
wave are also described. It concludes with summary descriptions of several popular 
software packages for HF communication link prediction. 

\ 

2.2.1 Physical Phenomenology 

The wave propagation basics addressed in this section assume that the wave is traveling 
in a region devoid of any background magnetic field such as the Earths. This is, of 
course, an oversimplification because such magnetic fields complicate the picture, but 
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the present discussion is useful as background. The deviations from the simple model 
are discussed further in section 2.2.9. 

In free space, a plane radio wave is identified by its Poynting vector, P, aligned with 
the direction of propagation. Electric and magnetic field vectors E and H, respectively, 
lie in a plane perpendicular to the Poynting vector, orthogonal to each other. At any 
instant, the magnitude of E is rj times the magnitude of H, where r] is the characteristic 
wave impedance. Recall that 77 = A //x/e, where ji is the magnetic permeability, and 6 
is the dielectric permittivity of the medium in which the wave is propagating. These 
relationships can be summarized by noting that the electric and magnetic vectors are 
related to the Poynting vector by 

P = ExH, (2.1) 

where x represents a cross-product operation. 

If the wave is linearly polarized, the direction of the electric field remains parallel 
to itself as the wave propagates. More generally, the wave has an elliptical polarization 
that can be represented by the sum of two linearly polarized waves with orthogonal 
electric field vectors. The relative magnitude and phase of the corresponding two elec¬ 
tric field vectors define the polarization characteristic of the composite wave. If the two 
waves have equal magnitudes and differ in time phase by 7t/2, the result is a circularly 
polarized wave. 

Maxwells equations provide the foundation for the interactions between a propa¬ 
gating electromagnetic wave and the ionospheric plasma through which it is propagat¬ 
ing. Consider such a wave, oscillating at a radian frequency co launched vertically from 
a point on the Earths surface. Initially, the wave propagates in a neutral atmosphere 
and travels with the speed of light, c — \/^/e 0 fji 0 } and a phase constant f} 0 = co/c, but 
as it moves upward, thejyave encounters an increasing electron density as described in 
the previous section (see figure 2.4). This plasma-like environment affects the dielectric 
constant k {k — e /e 0 ) of the medium through which the wave is propagating, thereby 
influencing the propagation velocity and consequently, the phase constant. 

Consider a specific volume in the region in which the wave is propagating. The E 
field component of the propagating sinusoidal wave exerts a sinusoidal force on the free 
electrons, F = — eE, causing them to vibrate with the same frequency as the wave. As 
with any body, the electrons respond to a force with an acceleration a proportional to 
the force, with the proportionality constant defined to be the mass of the body, in this 
case the mass of the electron m e . These vibrating electrons are effectively a sinusoidal 


1 Here the subscript o denotes the free-space values of the subscripted parameters. 
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current in the plasma. This current, in a representative cubic meter, is computable from 
the total charge in the volume, the electron density N times the charge of one elec¬ 
tron e, times the velocity v of the charge, i = — New. But, since a change in velocity is 

an acceleration, a= we have ^ = —Ne^. This acceleration corresponds to a force 


d\ 

F — —eE —me¬ 
at 


m e di 
Ne dt 


( 2 . 2 ) 


Because the electromotive force is proportional to the derivative of the current, the 
current i is inductive. 

In addition to this inductive current, the volume exhibits a capacitive characteristic 

JTJ 

in accordance with i = e 0 ^ . If the E field of the wave is represented by E = E m sin cot, 
the total current becomes 

i = co (e 0 — Ne jm e oo)E m cos cot . (2.3) 


If the medium had no free electrons, the specific volume under consideration would 
only manifest the capacitive characteristic identified above; with the addition of the 
inductive component, the effective capacitance is reduced in proportion to the free- 
electron density in accordance with equation (2.3). Thus, the bracketed quantity is the 
effective dielectric permittivity for the plasma with electron density N. 

Specifically, 


Ne 2 

£ — € 0 V 

m e co L 


(2.4) 


In this expression, N is the charge density, that is, the number of electrons, each of 
charge e , per cubic meter; m e is the mass of an electron; and co is the radian frequency 
of the wave. 


2.2.2 Ionospheric Refraction 


This modification to the dielectric permittivity affects a plane wave propagating into 
the ionosphere from below. Figure 2.5 illustrates the key ideas that apply when this 
wave experiences an abrupt change in the medium through which it is propagating. In 
this situation, such an abrupt change is an abstraction because the ionosphere is not 
sharply bounded, but the important ionospheric refraction issues can be understood by 
examining such a model. As presented with detailed analysis in electromagnetic theory 
texts [7, 8], the effects differ quantitatively depending on the polarization of the wave 
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(a) Polarization parallel to interface 



(b) Polarization in plane of incidence 
Figure 2.5 Polarization-dependent refraction 



22 Chapter 2 


crossing the boundary. The figure separates the two extreme cases: (a) the electric field is 
parallel to the boundary and (b) the electric field lies in the plane of incidence. Without 
repeating the details here, this analysis shows that there are reflected and refracted 
waves, and the angles of reflection and refraction are the same for either polarization 
of the impinging wave. However, the strengths of the reflected and refracted waves do 
depend on impinging wave polarization. 

It is important to understand the geometry associated with the refracted wave 
because this is the wave that propagates into the ionosphere and, in the cases of most 
interest here, continues to refract until it bends downward and returns to the Earths 
surface. 

A simplified model for ray refraction at an ionospheric boundary can be based on 
Snells law. Referring to figure 2.6, consider such a wave propagating upward in a 
neutral atmosphere and reaching an ionospheric layer characterized by an index of 
refraction n\\ its angle of incidence on this layer is 9 0 . Upon reaching the boundary, 
the tangential E and H vectors must be continuous, implying that the phase velocity 
along the boundary must be the same on both sides. If v 0 is the velocity with which 
a particular phase front moves along the propagation direction below the boundary, 
the phase velocity, as measured along the boundary just below it, is v 0 /sm9 0 \ if v\ is 
its velocity along the propagation direction above the boundary after refraction, the 
phase velocity as measured along the boundary just above it is i>i/sm6\. But, as just 
stated, these quantities must be equal, leading to Snell’s law : 

• n • n ' l^ol^o . a n ° . a 

sin u\ — — sin U 0 — /-sin 9 0 — — sin 9 0 , 

Vo V n\ 


where n 0 and n\ are the indices of refraction in the two media. 

Because the index of refraction in the ionized layer, n\, is less than its counterpart 
in the neutral atmosphere where n 0 = 1, the angle of refraction 9\ is greater than the 
angle of incidence 9 0 . Thus, the ray bends toward the boundary, as shown in figure 2.6. 
Of course, the ionosphere is more complex than a single layer with constant index of 
refraction; such a real ionosphere can be modeled as a series of thin layers with varying 
indices. 


2.2.3 Ionospheric Phase Velocity 

In addition, the continuity conditions along the boundary imply that planes of con¬ 
stant phase are continuous across the boundary and must remain continuous as the 
waves propagate. As a result, the horizontal distance, between two points a wavelength 
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Figure 2.6 Snell’s law refraction geometry 


apart along their velocity vectors must be the same on both sides of the boundary. This 
must be true even though the wave changes direction due to the change in refractive 
index as the ionospheric layer is entered. 

Along the impinging ray path in the neutral atmosphere, the distance between wave 
peaks is X 0 ; along the boundary, this distance increases to X 0 /sin 6 0 due to the geomet¬ 
ric relationship between the ray and the boundary. After refraction, the angle between 
the ray and the boundary becomes smaller in accordance with Snells law, as discussed 
earlier. As a result, the horizontal distance between wave peaks would decrease in 
accordance with the geometry unless the phase velocity were higher to make up for 
the decrease. Such a phase velocity increase is necessary to maintain phase continuity 
across the boundary. 

Since regions in which ionospheric refraction is of interest have no significant 
magnetic properties, it is appropriate to assume that /i\ = [i 0 . Thus, the ratio of 
dielectric permittivities becomes the key parameter. Instead of propagating with 
a phase velocity c as in free space, the wave propagates with a higher phase 
velocity, 




Ne 2 

m e o) 2 e 0 



(2.5) 


Although the ions in the medium are also affected by the passing wave, their mass 
is much greater than that of the electrons; as a result their response to the wave is 
negligible.^ 
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If the wave frequency / is expressed in hertz and appropriate values are used for all 
constants, this becomes 


The result is that the phase velocity increases as the electron density increases until 
7V=/ 2 /81. At this point, the upward-propagating wave is reflected and returns to 
Earth. 

An alternative interpretation of these relationships recognizes the factor in paren¬ 
theses as the square of the index of refraction, n. Thus, 



where/, = 9 is the so-called plasma frequency , also known as the criticalfrequency. 

Some authors present ionospheric electron-density data as plots of plasma frequency 
versus altitude; this approach will not be used here. 

2.2.4 Ionospheric Group Velocity 

As indicated previously, a wave without modulation does not transfer information 
because all segments of the wave are indistinguishable. However, when a wave is mod¬ 
ulated, it is possible to identify a specific portion of that wave, to observe how long 
it takes for that specific wave segment to travel some distance, and to compute its 
velocity. This velocity is known as the group velocity. 

A simple model that illustrates this concept can be based on a wave with two spectral 
components of equal magnitude but slightly different frequencies. Assume the waves 
have equal magnitudes E 0 , frequencies co ± A<z>, and phase constants /3 0 ± A/3. It is 
straightforward to show that the combined wave can be represented by the expression 

2 E 0 cos ( co 0 t — frx) cos (A cot — A/3 z). (2.8) 

This can be viewed as a rapidly varying carrier wave, at frequency co 0 , with a slow 
modulation at frequency A co. Since the locus of constant phase points satisfies 
co 0 t ~ Po x = constant, one can recall the phase velocity , v, where 

dx co 0 
dt l$ 0 



v 


(2.9) 
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In addition, the modulation phase satisfies A cot — Aj3 0 x = constant, leading to the 
group velocity , u 

dx A oo 
dt A/3 


( 2 . 10 ) 


In a plasma such as the ionosphere, the dielectric permittivity 6 7 ^ e 0 , but is instead 
€ ° n% = 

The propagation constant /3 —2jt/X. As noted earlier, the ionospheric change to 
permittivity modifies the distance traveled by a wave in the time corresponding to one 
sinusoidal period, defined as one wavelength. Since the wavelength is based on carrier 
variation, the relevant wave velocity is the phase velocity, which was shown to be equal 
to cIn. As noted also, since n is less than unity in the ionosphere, the phase velocity is 
greater than c, and the wave travels a greater distance in one temporal cycle. Thus, the 
wavelength is increased in the ionosphere by a factor of n. 

But the situation is different for the modulation where the group velocity is the 
relevant measure of speed. Equation ( 2 . 10 ) indicates that this quantity is computed 
as the derivative of the radian frequency 00 with respect to the phase constant /3. By 
definition, 

P = 2n/X = 2 rtf/Xf = co/kf = co/nc , (2.11) 


^ as detailed previously. Here n is the index of refraction. 


or 

Therefore, 


Note also that the product of group and phase velocities is 

uv— {nc){c/n) = c 2 . (2.14) 

In summary, a wave propagating in the ionosphere travels with a phase velocity higher 
than the speed of light by the factor while the modulation travels with a group 
velocity lower than the speed of light by the factor n. 


co — nc(3. 


A co dco 

u — -= — = nc. 


A/3 d/3 


( 2 . 12 ) 


(2.13) 


2.2.5 Vertical Sounding 


As noted,-4f the electron density increases to a point where N =f 2 /8l, the wave 
no longer penetrates the ionized medium but instead is reflected back toward Earth. 
This effect is the basis for ionogram generation: a ground-based transmitter launches 
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a series of pulses in a vertical direction, each at its own radio frequency but with 
successive pulses at linearly increasing frequencies; a receiver measures the delay to 
a returned pulse, if there is one. At frequencies where Af =/’ 2 /81 a pulse is returned; 
then the height of the layer meeting the reflection condition is estimated using the 
free-space wave velocity and delay. If a vertically propagating wave has a frequency 
below fp , it will be reflected, regardless of the angle of incidence. As will be seen 
shortly, if the wave frequency is above fp , the angle of incidence must be small 
enough so that its sine equals the index of refraction for it to be reflected back toward 
Earth. 

The process of estimating the height of a layer from measurements leads to the 
term virtual height . If the delay between a vertically transmitted pulse and its received 
reflection is measured as r, the height of the reflecting layer, the virtual height, is 
calculated to be h v = cr/2. But this is only an estimate of the true layer height. Because 
the group velocity of the pulse during the portion of its route through the ionosphere 
is lower than the speed of light, the virtual height will be greater than the actual height. 
Usually, the difference is small, but it could be as large as 100 km. 

Consider a situation where sin6 0 = In this case, sin@i =1, and the angle of 
refraction is 7r/2 for the frequency at which this condition holds; this frequency 
is known as the classical maximum usable frequency, or MUF? The incident wave 
is refracted along the layer boundary. The angle of incidence corresponding to this 
condition is 

or 

cos 2 6 0 = (fp/fMuf) 2 > (2.16) 

resulting in the condition 

fMUF —fp sec 9 0 . (2.17) 

Thus, the plasma frequency is a fundamental parameter controlling the conditions 
under which an incident HF wave will be reflected. Snells law is applicable as long 
as spatial changes in the index of refraction are small compared with the wavelength 
involved. If this condition does not hold, there will be multiple reflections and refrac¬ 
tive direction changes, making the ray path description fairly complex. Nevertheless, 
in the limit of an abrupt index of refraction boundary, there will be a true reflection, 
as outlined earlier. It is worth noting that in the case of sporadic E, the layer is thin 

2 The term operational MUF is also used; this frequency considers factors associated with the transmitter and 

receiver, such as antenna type and transmit power. 
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and fairly static, allowing the reflective interactions associated with abrupt index of 
refraction transitions. 

If the wave is propagating vertically toward the ionosphere, 6 0 — 0; then, the clas¬ 
sical MUF will equal the plasma frequency. If the propagation geometry involves a 
greater angle of incidence, the MUF increases. As an example, a typical F 2 layer elec¬ 
tron density is 10 12 electrons per cubic meter. Thus, the plasma frequency (or vertical 
MUF) is 9 MHz. Under these conditions, a vertical wave would reflect at this fre¬ 
quency; if a communicator hopes to communicate with an elevation angle of 30°, the 
angle of incidence will be 60°, with a secant value of 2.0. Thus the highest frequency 
that will support communication with this geometry is 18 MHz. Typically, a commu¬ 
nicator will use a frequency that is 90 percent of the classical MUF for the intended 
communication geometry. 

2.2.6 Ray Tracing 

Now that the physics governing interactions between the ionosphere and a prop¬ 
agating wave have been described, a more pictorial view of their paths can be 
presented. Assume that an antenna with an upwardly hemispherical pattern transmits 
a signal, with the goal of reaching a potential receiver some hundreds of kilometers 
away. Consider the family of rays emanating from this transmit antenna, each with 
a different elevation, or takeoff/ angle, in the plane containing the transmitter, the 
receiver, and the Earths center. Figure 2.7 illustrates the geometry for one of these rays, 




Ionosphere 



Figure 2.7 Transmitter-receiver geometry for ionospheric communication 
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Sample ray traces for given ionosphere + magnetosphere 
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Figure 2.8 Typical ray paths and their interactions with the ionosphere 


but each ray will reach the ionosphere with a different angle of incidence. Figure 2.8 
illustrates the paths taken by several of these rays. Although highly simplified and 
ignoring magnetic field effects, this ray picture is useful in providing conceptual 
insights into the morphology of ionospheric communication. 

Consider first the interface between a neutral atmosphere and a single ionospheric 
layer with a uniform index of refraction, n . If its wave frequency is below the plasma 
frequency, the index of refraction will be imaginary, and no refractive wave will be pos¬ 
sible; in that case, a ray will be reflected regardless of the angle of incidence. However, 
if the wave frequency is higher than the plasma frequency, refraction will occur, but 
the resulting ray path will depend on the angle of incidence, 6 0 , as well as the index of 
refraction, n . As described in section 2.2.3, in this case the phase of a signal on a ray 
at the upper portion of the wavefront reaching this layer will advance faster than a 
ray at the lower portion due to the lower index of refraction characterizing the layer; 
both rays will experience a greater phase velocity than in the neutral atmosphere. The 
result is a ray bending toward the horizontal (the angle of refraction is greater than 
the angle of incidence) as predicted by Snells law. If the angle of refraction equals 
90°, n — smd 0 , the condition is denoted as “total refraction,” and the ray propagates 
parallel to the layer boundary. 
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Now consider the family of rays at various elevation angles as introduced in 
figure 2.8. For a ray with 6 0 close to 90°, the total refraction condition will be sat¬ 
isfied when n is only slightly less than 1; that is, when the electron density is similar to 
that occurring slightly into the increased free-electron density representing the iono¬ 
sphere. As the angle of incidence decreases; that is, the wave propagates More vertically, 
n must be smaller and smaller, and the electron density larger and larger, to satisfy this 
total refraction condition. Eventually, if n — sin 6 0 at some altitude, the wave turns 
toward Earth; otherwise, if this relationship cannot be satisfied, this wave penetrates 
the layer and goes off into space. 

In actuality, the ionosphere does not consist of such a single layer with constant 
electron density. A more accurate model could be constructed from a series of lay¬ 
ers and the path of a ray computed by integrating the effects of the multiple layers, 
thereby determining a rays trajectory through the ionosphere. Typically, the iono¬ 
sphere is characterized by an increasing electron density that increases from zero below 
the ionosphere to a maximum level and then decreases. 

Returning to the family of rays just introduced, consider how the situation changes 
as the elevation angle of a ray increases from zero. The corresponding angle of inci¬ 
dence on the ionosphere decreases, and assuming the ionospheric conditions permit 
reflection, the ray returns to Earth at a shorter distance from the transmitter. At some 
point, the angle of incidence becomes small enough so that the ray penetrates the iono¬ 
sphere and does not return to Earth. The point where the resulting communication 
range is at a minimum is called the skip distance. 

As described earlier, the ionosphere is made up of several layers, and it is the com¬ 
posite of these that determines the overall behavior. For example, if there are two 
ionospheric layers, a ray penetrating the lower layer may be reflected back to Earth by 
the higher layer; such a layer could reduce the skip distance below that determined 
by the lower layer. The minimum of these layer-by-layer skip distances becomes the 
prevailing skip distance for the overall ionospheric conditions. 

This description of skip distance has focused on the geometry for a specific wave 
frequency. But the situation is affected significantly by the value of this frequency. 
These interactions will be considered further in section 2.2.9. 

At distances greater than the skip distance, two rays can reach a receiver from a 
specific transmitter, each with its associated takeoff angle. The ray with the smaller 
takeoff angle and larger angle of incidence will encounter a reflecting electron density at 
a lower altitude than the ray with a larger takeoff angle and smaller angle of incidence. 
As a result, two rays launched from the same point may both arrive at the same receiver. 
Usually, the upper ray supports a weaker signal than the lower ray, but with Earth 
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curvature, the former may be the only ray reaching the receiver. Detailed computations 
for the transmit-receive pair and the prevailing electron-density profile are necessary 
to predict the occurrence of such situations. This situation will be discussed further 
when the effects of the Earths magnetic fields on ionospheric propagation are taken 
up (section 2.2.9). 

Under some conditions, the skip distance may be much shorter than the transmit- 
receiver range, and a ray may return to Earth well short of the receiver. Such a ray 
could be reflected upward by a ground bounce and then interact with the ionosphere 
a second time, returning to Earth at the receiver location. Such multiple bounce situ¬ 
ations become relevant at very long ranges and can allow a strong transmission to be 
received from much longer ranges than would be expected for single-hop propagation. 
These situations will not be considered in detail, but they occur often enough for the 
communicator to be aware of them. 

To be more specific, the longest single-hop communication range would typically 
occur for a takeoff angle of 0°. If ionospheric conditions in the E layer are appro¬ 
priate, the maximum one-hop propagation distance would be about 2000 km; for the 
F layer, this range would be about 4000 km. In order for communications to occur over 
longer ranges, multiple hops would be necessary, with appropriate refraction condi¬ 
tions at each point where the wave interacts with the ionosphere. It is also worth noting 
that these conditions could be substantially different on east-west paths because the 
day-night terminator, with its associated high ionization levels, may lie between the 
transmitter and the receiver. 

In this discussion of specific ionospheric situations, the usual pattern on an iono- 
gram shows a single reflection delay for a given frequency; this is the result of a total 
reflection by the lowest layer, and therefore no energy is available to propagate to a 
higher layer for reflection at its altitude. However, when sporadic E conditions exist, 
such layers are fairly thin and thus can partially reflect a wave and partially allow 
it to pass through for reflection from a higher Fi or F 2 layer. Thus, at some fre¬ 
quencies, reflections from both the sporadic E layer and the F layers above can be 
seen. 

It is also noteworthy that in situations where an upper layer reflects a ray that has 
passed through a lower layer, the interaction with the lower layer increases the time 
spent in that layer relative to equivalent free-space propagation. As a result, the virtual 
height of the upper layer is greater than it would have been had the lower layer not 
been present. 

If a communicator desires to obtain the longest-range transmission, she/he should 
use the highest frequency that will produce a reliable reflection. Note that a higher 
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Figure 2.9 Ray path variation with time of day and frequency: (a) 6 MHz, (b) 9 MHz 


frequency will not be reflected back toward Earth but will continue passing through 
the layer, albeit with a modified propagation path. 

Ray traces such as those shown in figure 2.8 can be used to show diurnal variations 
in the ionospheric propagation as a function of time of day and frequency. Figure 2.9 
illustrates this idea showing six sets of ray traces corresponding to three times of day 
and two frequencies. Note that the 6 MHz plot for 1300 UTC shows that there are two 
paths reaching 500 km ground range, corresponding to a high elevation trace refracted 
by an F layer and a low elevation trace refracted by an E layer. 

At 9 MHz, a substantial range of propagation distances is not supported by the 
ionosphere. This is especially true in the early morning, but at no time of day does 
any ray'reach distances less than 700 km for this frequency. The situation is better 
at 6 MHz, though in the morning, before sunrise, rays launched at steep angles pass 
through the ionosphere, which has limited ionization, and do not return to Earth. 
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As the sun rises and induces ionization, these rays are refracted and return to Earth 
to support propagation at shorter range. Such variations with frequency and time of 
day are captured in the Ionospheric Communications Analysis and Prediction Pro¬ 
gram (IONCAP) and Voice of America Coverage Analysis Program (VOACAP) tools 
described later in this chapter. 

2.2.7 Breit and Tuve’s Law 

Consider next the complete path of a ray launched by a transmitter at an elevation 
angle 6 ; this ray will propagate upward until it reaches the lower edge of the ionosphere. 
Assuming flat ionospheric layers, parallel to the assumed flat ground, the ray will arrive 
with an incidence angle 0 o = 90° — c. Snells law will govern the trajectory of this ray 
once it passes into the ionospheric layer. As described in section 2 . 2 . 2 , its dielectric 
properties differ from those of free space as characterized by the index of refraction, 
n . The phase velocity increases from its free-space value, c, to cjn, while the group 
velocity, with which energy travels in the ionosphere, is reduced to a lower speed nc. 

As dictated by Snells law, at a point in the ionosphere where a refraction index n 
occurs, the angle of the ray with the local vertical has changed to 0 such that 

^ sin 0 = sin 00 . (2.18) 

Once the refractive index becomes equal to sin 00 , sin0 = 1 , and thus 0 = 90°. Thus, 
the ray becomes horizontal and it begins its return toward the Earths surface where a 
receiver at location R is awaiting it. This situation is illustrated in figure 2 . 10 , where 
T represents the transmitter location, R the receiver location, and B the point where 
a ray launched at an angle e becomes horizontal. 

It is interesting to note that this ray reaches the ionosphere with a horizontal wave¬ 
length component equal to X cos e = A, sin 00 . As described earlier in section 2 . 2 . 2 , 
since P h ase planes across a boundary must be continuous, the horizontal component of 
the wavelength inside the dielectric ionosphere must also be X sin 9 0 . This logic applies 
incrementally along the ray path all the way from the transmitter to the receiver. 

In summary, a ray launched at the angle 6 from a transmitter at location T in 
figure 2.10 reaches the lower edge of the ionosphere at an incidence angle 9 0 = 90° — € 
and begins to bend, as previously described. At point B, conditions are met for the ray 
to reach its maximum height, and it subsequently returns to Earth at point R. 

Now consider the time it takes a wavefront to travel from T to B, the point where 
the ray becomes horizontal. Before reaching the ionosphere, the phase velocity is equal 
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Figure 2.10 Geometry of Breit and Tuve’s law 

to the speed of light, c , with horizontal component ccos€. As explained, this hori¬ 
zontal component remains unchanged as the ray enters the ionosphere, even though 
the ray bends toward the horizontal. With this horizontal component fixed at c cos c, 
the time for the wavefront to travel the distance DJ 2 , the distance from T to the 
point on the Earth surface directly under B, is D/2~ccos€. A similar discussion 
applies to the second half of the rays passage from T to R through point B. As a 
result, the total time for a wavefront to reach a receiver at R, a distance D from T, is 
D/(ccos€). 

Finally, consider the triangular path TCR shown in figure 2.10, assuming no 
ionospheric effects. Point C is defined by the ray launched at the angle 6 from 
the transmitter at point T and by a backward ray launched at this angle from 
the receiver at point R, a distance D away. With no ionosphere present, a wave- 
front would travel from T to C at the speed of light and reach C in a time = 
D/(2c cos c); an identical time would be required for the wavefront to travel from 
C to R. As a result, the total time to traverse the path TCR is given by = 
D/(c cos c), the same time required for the path TBR. This proves the Breit and Tuve 
theorem'. 

Theorem 1. The delay experienced by a wavefront launched at an angle 6 from the 
horizontal , traveling an ionospheric path to a receiver a distance D away\ is equal to 
D/(c cose). 

This theorem will be a critical component when geolocation techniques are covered 
in chapter 10. 



34 Chapter 2 


2.2.8 Energy Loss in the Ionosphere 

In regions where free electrons are available, they are excited by a passing wave and 
can collide with heavy neutral atoms, depositing the energy received from the wave 
and thereby causing a loss in the wave s energy. This process is known as nondeviative 
absorption . Usually, there are insufficient interactions to affect the index of refraction, 
which remains near unity. As a result, the ray path remains straight. However, if the 
collisions are frequent enough, the loss has a significant effect on the waves ampli¬ 
tude. The magnitude of the loss depends on several factors, including gas pressure, 
electron velocity due to wave interactions, and electron density. The interplay of these 
factors results in the greatest losses at lower altitudes where the gas pressure is high¬ 
est, that is, in the D region and the lower E layer. Because the electron density must 
be relatively high for the losses to be substantial, the effects of nondeviative absorp¬ 
tion are only seen in a sunlit ionosphere. Absorption is also greater if the frequency is 
lower because the average velocity attained by the electrons is inversely proportional 
to frequency. 

The energy loss can be modeled as an atmospheric conductivity, and its effect is 
an attenuation of HF signals as they traverse the ionosphere. The absorptive loss is at 
its maximum during daylight hours when free electrons are available at low altitude 
where neutral particles are dense. Also, the ionization efficiency of the illumination is 
a strong function of the solar zenith angle, making the loss dependent on time of day, 
season, and geographic location. 

Above 2 or 3 MHz, electrons will execute several vibration cycles before hitting a gas 
molecule. The resulting nondeviative absorption is an attenuation in decibels/milli¬ 
watt that is proportional to (electron density) x (gas pressure)// 2 . 

More specifically, if the electron vibration frequency is ^>l/(mean collision time), 
absorption per kilometer in decibels at a frequency/ can be modeled, assuming flat- 
Earth geometry, as — ^dB^fo)\fo / f\Y sec/, where / is the solar zenith angle, 

and f 0 is a reference frequency at which the absorption constant of proportionality has 
been measured on a vertical path; v is approximately 2. The sec / factor represents the 
longer path experienced by a ray that is not vertical. For curved Earth, the geometric 
factor represented by sec / is more complicated but is numerically <6. 

Consider a situation in which the communicator wants to utilize E layer propa¬ 
gation. From the expression for attenuation, one can see that absorption decreases as 
operating frequency f s ig increases. But if there is to be reflection from the E layer, f s jg 
must be less than /g-, the E layer critical frequency. Therefore, operators try to choose 
a frequency as close to the maximum usable frequency, /muF? as they can. Then, the 
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attenuation per kilometer in decibels is given by a = C{fg / fsig) 2 > where C is a constant 
that captures the current ionospheric conditions. 

Unfortunately measurements of D layer absorption are not readily available. How¬ 
ever, the maximum electron density in the D layer is proportional to that in the E 
layer so that published observations of the latter can be used as a clue to D layer 
absorption expectations. These will provide information regarding the variation of 
the constant C with frequency, which can then be used in the expression for a for 
frequency-management purposes. 

In additon to nondeviative absorption, there is attenuation due to deviative absorp¬ 
tion, a resonance phenomenon involving details of the ionospheric makeup and the 
wave frequency. One example occurs at high altitudes, where the plasma is too thin 
for sufficient collisions, and as a result, there is no nondeviative absorption. In these 
regions, resonant interactions become important, including electron Landau damp¬ 
ing ,, in which the HF field resonates with the thermal speed of electrons. Because these 
effects are weaker than the nondeviative absorption, they will not be discussed fur¬ 
ther. But nondeviative absorption is important up to 120 km altitude, until the neutral 
density becomes low enough to make collision-based losses insignificant. 

2.2.9 Earth Magnetic Field Effects 

Magnetic field effects are a primary cause of communication bandwidth limits for 
HF channels. As discussed earlier, the ionosphere will support multiple paths between 
a specific transmitter and receiver pair; each of these will manifest a different time 
delay and delay spread. An electromagnetic wave entering an ionized medium in the 
presence of a magnetic field, in this case that of the Earth, will be split into at least 
two propagating waves, or modes; the details depend strongly on the direction of 
the wave relative to the Earths magnetic field. Consider a wave with wavevector k; 
in general such a wavevector will be oblique to the Earths magnetic field vector Be. 
Thus, the waves E field will lie in a plane that is oblique to Be and can be resolved 
into components parallel and orthogonal to Be- Since the component that is parallel 
to Be will not be affected by the magnetic field, it is commonplace to define such a 
component as the electric field of the ordinary , or O-mode. The E field component 
that is not in the plane parallel to Be will be affected by the magnetic field and is 
denoted by the extraordinary , or A-mode. 

Figure 2.11 illustrates this geometry for a case where the wave vector k is orthogonal 
to the Earths magnetic field. In the figure, the coordinate frame is chosen such that 
the z m axis is aligned with the Earths magnetic field and is perpendicular to the page; 
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Figure 2.11 Electric field vectors and Earth magnetic field for O- and X-modes 

the x m —y m plane corresponds to the plane of the page. A wave is assumed to be 
propagating in the direction shown by the wavevector k, with its electric field vector 
lying in the plane orthogonal to k, arbitrarily shown at some instant in time. Then, 
the z m component of this wave’s E field, which is unaffected by the magnetic field, 
defines the ordinary wave. The component of E in the x m —y m plane, E*, defines the 
extraordinary wave. 

Under the assumption that the Earth’s magnetic field does not vary significantly 
in the ionospheric region where the wave is refracted, the coordinates for this reso¬ 
lution of the arriving wave will be unchanged throughout its trajectory through the 
ionosphere. If it is assumed that the polarization of the arriving wave is not linear, the 
resolution of its electric field in the Earth’s magnetic field oriented coordinate system 
into components will change with time. However, since the Earth’s magnetic field ori¬ 
entation can be assumed to be fixed over the refraction interval, the wave electric field 
component parallel to the fixed Earth magnetic field vector will continue to have the 
same linear polarization. The two orthogonal components in the plane perpendicular 
to the Earth’s magnetic field will have arbitrary amplitudes and phases, making the 
X-mode wave elliptically polarized. 

This representation of the two ionospheric propagation modes, O-mode and 
X-mode , is one of several common descriptions of the two modes that can propagate 
in a cold plasma. Another will be described after the interactions of an arriving wave 
with the ionosphere are discussed in more detail. 

With these coordinate system preliminaries out of the way, the interactions between 
a wave and the magnetic field in the propagation medium can be described. First, 
the term gyrofrequency must be introduced. It is well known that a particle bearing 
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a charge q , moving with a velocity v in a magnetic field B, experiences a Lorentz 
force F = ^vxB. This force acts on the particle, in this case the electron, causing 
its normally random motion to concentrate in the plane orthogonal to that of Be, 
oscillating at the gyrofrequency/ = eBE/27tm e , where e is the electron charge, m e is 
the electron mass, and Be is the magnitude of the Earths magnetic field.. 

These effects induce changes to the index of refraction that influence the path taken 
by a wave at frequency/ passing through the ionosphere. Letting X — ( fp/f ) 2 and 
Y = ( fg/f)> the Appleton-Hartree equation [1] can be written as 


n 


l-2X(l-X)/ 


2 ( 1 — x) -y%± JyfYY-X)Xi 


(2.19) 


Here, Yj—Y sin (9, Yi— Y cos #, and 6 is the angle between the wavevector k and the 
Earths magnetic field, Be- Thus, Ye and Yi can be regarded as components of a vector 
aligned with the wave direction, having magnitude equal to the ratio of gyrofrequency 
fg to wave frequency/; Yi is the longitudinal component, and Yj is the transverse 
component relative to the magnetic field direction. 

The Appleton-Hartree equation can be used to motivate the two modes identified 
above [9]. The + component of the ± in this equation provides the value of n appro¬ 
priate to the O-mode, while the — component gives rise to the value for the X-mode. 
The additional pair of mode identifiers, R-mode and L-mode , which are associated with 
a wavevector component parallel to the magnetic field, can also be motivated by the 
Appleton-Hartree equation. R- and L-mode values for n are obtained by noting that 
these modes correspond to a situation when the angle 0 is zero. In this case, Yj = 0, 
and Yl=Y. In such a situation, the R-mode corresponds to using the — component 
of ± and the L-mode to using the + component. 

Note especially that when the plasma is not magnetized or the Earths magnetic field 
is ignored, as was done in most of this chapter, / = 0. Then Y = Yi—Yj — 0, and 
the Appleton-Hartree equation can be reduced to the equation given earlier for the 
index of refraction, 

n 2 = \-X = \-(f p /f) 2 . (2.20) 

As was described when magnetic field effects were ignored, a propagating wave s elec¬ 
tric field vector, oscillating with the wave frequency/, perturbs the random motion 
of electrons in the plasma. The O-mode wave, with its electric field parallel to the 
magnetic field, is unaffected by the magnetic field; that is, electrons corresponding to 
the E fiekfoscillation move along the magnetic field and are not impeded by it. On 
the other hand, electrons corresponding to the X-mode E field attempt to move across 
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the Earths magnetic field, but their motion induces forces that modify the wave fields. 
As a result the X-mode fields are modified, whereas the O-mode fields are not. 

Another effect of these interactions is to modify the relationship between the group 
velocity vector and the phase velocity vector. In a vacuum, these are generally aligned 
with each other. In a cold plasma, such as the ionosphere, the group velocity vector, 
being derived from the composite of the two propagating modes, is modified and no 
longer aligned with the wave vector. 

These interactions of the original transmitted wave with the magnetically influenced 
ionosphere induce different phase velocities for the various modes, which depend 
on the plasma conditions, the Earths magnetic field, and the wave properties. As a 
result, although the originating wave may have one set of phase relationships among 
its two modes, once wave-plasma interactions have taken place, these phase relation¬ 
ships can be different. Because the received signal is the vector sum of components 
from two modes, where there was originally constructive addition, the result after pas¬ 
sage through the ionosphere is also different. The received signal can manifest fading 
and bandwidth limitations that were not present in the transmitted signal. 

Another important effect of passage through the ionized plasma is known as the 
Faraday rotation. As discussed, the various modes undergo different changes to their 
phase velocities and a plane-polarized wave will emerge with its plane of polarization 
tilted with respect to its original orientation. Because the R- and L-modes have dif¬ 
ferent phase velocities, their E field vectors will undergo different amounts of rotation 
after traversing the same distance. This induces the change on polarization plane tilt 
indicated earlier, and because the changes in phase velocity are related to the mag¬ 
netic field strength, this effect can be used to measure the magnetic field in such 
environments. 

In summary, the ionosphere acts as a lossy dielectric with properties that depend 
on the strength and orientation of the Earths magnetic field. These effects are also dif¬ 
ferent for the ordinary and extraordinary waves because these waves manifest different 
polarization states. Generally, however, below 65 km altitude, the electron density is 
too low for significant propagation loss. 

2.2.10 Transmission Curves 

So-called transmission curves are theoretically based tools that help interpret ionogram 
data in the choice of frequency for communicating a specified distance. As described 
in section 2.2.5, a vertical sounding ionogram provides direct estimates of ionospheric 
layer heights. Since the reflection is from directly overhead, the angle of incidence and 
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Figure 2.12 Normalized geometry for ionospheric communication computations 


the index of refraction are both zero. With the plasma frequency defined by^ 2 = 8W, 
the reflection condition for a potential communication path with angle of incidence 
9 0 becomes f =fp sec 9 0 . 

Now suppose it is desired to communicate using ionospheric refraction over a dis¬ 
tance d, and current ionogram data are available for the region where the link is to be 
established. It is appropriate to understand the relationship between the frequency 
axis of an ionogram and. the communication geometries supported by the current 
conditions when a particular communication frequency is used. 

Figure 2.12 provides a normalized (to Earth radius) picture of the relevant geometry. 
Basic trigonometric relationships can be used to determine the angle of incidence for 
a given communication distance d and ionospheric layer height h. 


tan 9 0 


sin (d/2R e ) 

1 + ( h/R e ) — cos {d/2R e ) 


( 2 . 21 ) 


Rearranging, the appropriate layer height can be found for a given angle of inci¬ 
dence 9 0 :-^ 

sin (d/2R e ) — [1 — cos {d/2R e )\ tan 9 0 


h = R , 


tan 9, 


0 


( 2 . 22 ) 
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Thus, one can start with a desired communication distance, assume a potential trans¬ 
mission frequency, and then compute the associated layer height as a function of the 
frequency reflected by a vertical ionosonde, recognizing that this frequency and the 
assumed transmission frequency determine the angle of incidence as specified in equa¬ 
tion (2.22). If a set of such curves is produced and overlaid on a measured ionogram, 
a range of appropriate transmission frequencies can be found. There must be an iono¬ 
gram trace intersecting a transmission curve for a given frequency if that frequency is 
to be viable. 

Figures 2.13, 2.14, and 2.15 illustrate the calculated transmission curves, and 
figure 2.16 shows how the overlay procedure provides the desired frequency-planning 
information for a transmission range of 1000 km. 

In this comparison of the theoretical transmission curves and a measured ionogram, 
one of the transmission curves will be seen to be tangent to an ionogram trace. The 
transmit frequency associated with the tangent transmission curve will be the maxi¬ 
mum usable frequency (MUF). In the example shown in figure 2.16, the MUF can be 
seen to be 6.6 MHz for a transmission distance of 1000 km. 

Alternatively, one could plot a transmission-curve series with distance as the param¬ 
eter and frequency held fixed. One of these transmission curves will be tangent to 
an ionogram trace; the distance associated with this curve is the corresponding skip 
distance. In such a situation, there is only one possible transmission path from the 
transmitter to the receiver at this frequency. 



Figure 2.13 Transmission curves for 500 km range 
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re 2.14 Transmission curves for 1000 km range 



Figure 2.15 Transmission curves for 1500 km range 


42 Chapter 2 



Figure 2.16 Transmission curves overlaid with ionogram trace 

Transmission-curve computations ignore the Earths magnetic field. Nevertheless, 
the results have been shown to be reasonably accurate in determining MUF and skip 
distance. This observation is attributable to the fact, discussed in section 2.2.9, that 
the ordinary ray path is only slightly affected by the Earths magnetic field. 

2.2.11 Propagation through a Disturbed Ionosphere 

Early in this chapter, the origins of the ionosphere were attributed to solar flux, parti¬ 
cles in the solar wind, and meteorites captured by Earth gravity. Above 65 km altitude, 
atmospheric density is low enough that the electrons freed by these sources of excitation 
remain free for extended intervals. Eventually, they are captured and neutralized by 
atmospheric ions, but as described early in this chapter, at these altitudes they remain 
free long enough to create the substantial population of free electrons, a plasma, known 
as the ionosphere. However, there are rapid variations in the arriving solar radiation 
that are especially significant in the portion of the atmosphere closest to the sun; these 
variations lead to variations in the plasma density not only temporally but also spa¬ 
tially. The result is a changing set of propagation modes for HF signals; these affect the 
performance of those communication systems that rely on ionospheric refraction to 
establish transmitter-to-receiver wave paths. 

The most common effects attributable to these events are absorption, fading, 
temporal signal distortion, spectral shifting, and frequency dispersion. In mild 
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disturbances, delay spreads of as much as 100 ms can occur, leading to potential sig¬ 
nal distortion and communication errors. Doppler shifts of several hertz are a normal 
contributor to ionospheric propagation characterization, but the Doppler spread is 
usually limited to 1 Hz or so. Under severely disturbed conditions, the time-varying 
scatter and multipath, induced by moving ionospheric irregularities along the path 
traversed from the transmitter to the receiver, can induce delay spreads of up to 1 s and 
Doppler spreads of as much as 20 Hz. Such effects can all occur on a single path. Addi¬ 
tionally, multiple paths from a single transmitter can combine coherently at a single 
receive antenna to produce multipath cancellation and enhancement effects. 

Along the day-night terminator, there is a rapid change in the plasma density, which 
persists for about an hour near sunrise as the sun first affects the upper atmosphere. 
The sudden changes induce severe effects on HF communication links. Although such 
effects occur at sunset as well, the decay in plasma density is less traumatic than the 
sudden increase at sunrise, and the effects on HF link performance are less severe. 

It is instructive to consider a simple example, such as that illustrated in figure 2.17, in 
which multipath effects produce significant effects on the received signal. This example 
ignores the presence of the Earths surface and assumes there is a direct path from the 
transmitter (T) to the receiver (R) and a reflected path off the ionosphere due to a 
ray with an elevation angle e. It is straightforward to compute the phase difference 



Transmitter Receiver 


Figure 2.17 Canonical multipath geometry 
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between signals on these two paths: 


Anfh (\ — cose 

/ = 2ir8/k = -( ■ 


sine 


If the altitude of the reflection point is changing with time, 

d'ij/ Arcf 1 — cos e 


dt 


k 


sine 


(2.23) 


(2.24) 


If h is varying approximately linearly with time, the rate of change in phase, that is, 
the frequency, is constant. This results in an amplitude modulation of the transmitted 
signal with a period T determined by the time required for the signal to go through a 
phase change of 2 it radians: 

d^lf _ 

-/• r = 27T, (2.25) 

dt 


or 



c sine 
2 fh 1 - cos e 


(2.26) 


This variation in the signal amplitude is known as fading. Because it is a function of 
the wave frequency/, it is denoted as selective fading. It should be obvious that if the 
signal is narrowband, the variations in signal amplitude across its bandwidth will be 
small, leading to the descriptive term flat fading. This condition can also be described 
by noting that when the term fh is low, the period of the amplitude change is long, 
and slow fading can be observed. Most readers will have experienced slow fading when 
listening to AM broadcasts in a moving vehicle. 

At HF frequencies, the product//? is indeed low, and these multipath effects are sig¬ 
nificant; as a result, most HF signal designs are limited to a few kilohertz in bandwidth. 
This is especially relevant for long-range communications that utilize the F layer, where 
layer altitudes can change fairly rapidly. With modern digital communication meth¬ 
ods, it will become apparent that a combination of (1) automated frequency selection 
utilizing channel probing, (2) digital signaling with a message-repeat protocol utilizing 
feedback, and (3) low data rates can lead to reliable end-to-end data transmissions. 


2.2.12 Popular Ionospheric Models 

This section summarizes several models used to predict ionospheric path loss between 
the transmitter and the receiver. 
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IONCAP. Hie Ionospheric Communications Analysis and Prediction model was 
developed by the Institute of Telecommunication Services (ITS) in the early 1970s 
to support U.S. government HF circuit operations. IONCAP is the most widely 
known predictive tool and uses semiempirically based data to approximate HF 
propagation losses between two points. Separate models are used for paths shorter 
and longer than 10,000 km. For the shorter paths, all propagation modes are con¬ 
sidered, including single and multiple hops refracted from the E and F layers. 
Superposition leads to an estimate of the total signal strength from all paths com¬ 
bined. The long-path model is less detailed with regard to losses, using a single 
coefficient as an estimate of loss per unit path length; of course, this coefficient is a 
function of frequency and diurnal factors. More details about the IONCAP model 
are summarized in [1]. 

VOACAP. With IONCAP as a starting point, the Voice of America office of the U.S. 
Information Agency developed and distributed a PC version known as VOACAP. This 
version includes a graphical user interface and provides easily used coverage diagrams 
showing propagation loss as a function of frequency and time of day Figure 2.18 
is an example of such a diagram, specifically for propagation from the Canadian 
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Figure 2.18 VOACAP example showing CFIU signal level at Lexington, Massachusetts 
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time station, CHU, to Lexington, Massachusetts. A user can easily obtain this PC 
software over the Internet. Its use is intuitive and simple. 

ICEPAC. Another upgrade to IONCAP that focuses on high-latitude propaga¬ 
tion has been developed. This version also includes various improvements to the 
calculations of circuit reliability and newer atmospheric noise model data. 
AMBCOM. Stanford Research International (SRI) has developed a two-dimensional 
ray-tracing code using a geometric optics approach to solving the wave equation. 
This code allows ionospheric tilts, such as those often observed at high latitudes, 
near the geomagnetic equator, and near the day/night transition, to be included 
in the modeling. As a result AMBCOM will predict unconventional propagation 
effects; however, it is a two-dimensional model and cannot predict effects transverse 
to the propagation direction. 

IONORAY. IONORAY is a three-dimensional ray-tracing code developed by Science 
Applications International Corporation (SAIC). It is based on five primary mod¬ 
els: the International Reference Ionosphere, a Mass Spectrometer and Incoherent 
Scatter atmospheric model, the International Geophysical Reference Field magnetic 
field model, a three-part auroral effects model, and a layers model that can assess 
the importance of sporadic E layers. With all these detailed features, IONORAY 
runs slowly, but its accuracy can justify the computer time and the effort required 
to obtain relevant input data. 

SKYCOM. The Ionospheric Support Branch of the Solar Terrestrial Dispatch devel¬ 
oped SKYCOM. Its focus is on disruptive effects on HF communications, including 
unusual geomagnetic activity, solar flares, sporadic E layers, and D layer absorption. 
SKYCOM is a high-fidelity modeling tool for sky-wave propagation predictions and 
not a communications link performance assessment tool. Nevertheless, it can be 
used to supplement a tool such as VOACAP, which does address communications 
system assessment. 

2.3 HF Noise Environment 

When designing an HF link, the signaling engineer has control over the transmitter 
power and antenna characteristics. These parameters, together with the prevalent path 
loss, determine the received signal strength. At the receiver, several sources of noise 
contribute to the determination of the received signal-to-noise ratio that establishes 
the communication link performance level. These HF noise sources include: 

• Galactic noise from celestial radio sources 

• Atmospheric noise from lightning 
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• Receiver internal noise 

• Man-made noise in the receiver operating environment 

• Cochannel HF signals utilizing frequencies close to the link operating frequency 

The levels of all of them depend on the time of day; season; geographic location; 
current sunspot number; and of course, frequency. A link designer must weigh all these 
effects in order to predict receiver performance. 

Fundamentals of receiver design show that the receiver internal noise N t can be 
easily computed from 

N t = k 0 T 0 BFL : , (2.27) 

where T 0 is the receiver operating temperature (°K), normally assumed to be 300°K; B 
is the receiver bandwidth, normally assumed to equal that of the signal; F is the receiver 
noise figure; L are receive antenna system losses; and k 0 is Boltzmanns constant. If N t 
is to be computed in units of dBm and B , F, and L are all expressed in decibel units, 
this can be written as 

N t (dBm) = -174 + B(dBHz) + F(dB) + L(dB). (2.28) 

It is worthwhile examining the bar chart of figure 2.19, which shows the relative 
magnitude of various HF antenna noise factors ( F a ) as a function of frequency [1]. 


Antenna noise figure contributions, 5 MHz 
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Antenna noise figure contributions, 10 MHz 
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Antenna noise figure contributions, 20 MHz 
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Figure 2.19 Comparison of various noise contributors in HF receivers: 7-galactic noise; 
2-rural man-made noise; 3-residential man-made noise; 4-business man-made noise; 
5-atmospheric noise, summer evening; 6-atmospheric noise, winter morning 
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Atmospheric noise is primarily due to lightning around the world. It typically 
decreases as the receiver is operated in areas away from the equator. This source of 
noise is highly dependent on time of day, season, and location. The International 
Radio Consultative Committee (CCIR) has published maps showing the geographic 
variation of the atmospheric noise throughout the world [10]. These maps are usu¬ 
ally fairly accurate in the evening. But in the morning and during the day, especially 
near built-up areas, noise is mostly man-made, and the CCIR levels are exceeded. 
Numerical models for these situations are available [11]. 

Galactic noise varies over the sky and is not evenly distributed since there is a cluster¬ 
ing of sources in the galactic plane. Such noise also varies with frequency, decreasing 
with higher frequency. Estimates of galactic noise for a particular receiver location 
must be obtained by integrating the overhead sky noise over the relevant segment 
of the celestial environment. Radio sky maps are useful as a starting point, but they 
are only available for VHF and UHF [12]; the mapped galactic noise estimate must 
be translated in frequency using the relationship A^/OC (f /f re f)~ 2 '^ and integrated 
over the antenna pattern to be useful. For the array antennas used here, the relevant 
antenna pattern in this integration is that resulting from the specific set of element 
weights employed in a given situation. Because these weights might be adaptive and 
depend on the particular signal scenario prevailing at a specific moment, it would be 
impossible to identify a precise antenna pattern to use in this integration. Instead, 
an analysis could be carried out for representative scenarios and the results used for a 
galactic noise estimate at HF. 

Interference from other HF radio transmissions is often the most challenging 
receiver design problem. Site surveys are often taken to characterize the site, but the 
results may be anecdotal because the HF operating environment is so dynamic and 
unregulated. Some attempts have been made to generalize these surveys and to provide 
a model for such interference [11,13,14,15]. 

2.4 Diurnal Variations 

As a summary of this description of the ionospheric propagation of HF radio waves, 
it is appropriate to discuss how such propagation varies as a function of frequency 
throughout the day. Before embarking on this discussion, however, it is important to 
remember that additional factors influence this propagation, in particular month of the 
year, sunspot number, and ray path location, and modulate the detailed conclusions. 
The ionospheric model VOACAP is appropriate for determining the details, and the 
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VOACAP example shown in figure 2.18 will be used as a vehicle for conducting this 
discussion. 

Note that this example is based on a sunspot number equal to 100, with a Sun- 
Earth-ray path geometry relevant to the month of June for a transmitter at Ottawa, 
Canada, and a receiver at Lexington, Massachusetts. The referenced figure shows that 
the MUF at any time of day is below 8 MHz and that there is a significant time interval, 
between 1300 UT and 2200 UT, during which the propagation losses are very high 
for any frequency. This time interval corresponds to local 0800-1700 EST, during the 
middle portion of the day when the D layer is heavily ionized and, as a result, there is 
an overriding level of attenuation. As the sun sets locally, the D layer ionization abates, 
and frequencies near 6 MHz begin to support useful propagation between these two 
sites. 

As the night progresses, lower frequencies become useful, until at 0300 EST the 
frequency of optimum transmission (FOT) drops to 4 MHz. At this time, there is a 
viable propagation path at all frequencies between 2 and 5 MHz. As the sun begins to 
rise, propagation deteriorates, until at 0800 EST propagation is largely blocked due 
to D layer absorption. 

Similar discussions for different months of the year, sunspot numbers, or ray path 
geometry could be included, but with these several key parameters influencing the 
results, there is a danger of incorrect generalizations. Instead, a reader wanting to 
develop intuition about HF channel utilization could consult a specialized text on 
that subject or, even better, could utilize the VOACAP tool and generate examples 
focused on the particular scenario of interest. 
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Common HF Modulation Protocols 


3.1 Introduction 

As indicated in the previous chapter, the ionosphere will support a link between a 
widely separated transmitter and receiver pair if the link parameters (antenna, oper¬ 
ating frequency, and so on) are properly chosen for the time of day and operating 
locations. However, the signal to be transmitted must be more than a continuous sinu¬ 
soidal tone (the carrier) that would convey no information unless it varied in some way. 
The ways in which the carrier is modified to convey information are addressed under 
the subject of modulation. Over the years since HF communications began, a number 
of signal modulation formats, as well as transmission protocols, have been tried, and 
those that provided the desired level of reliability and data rate continue to be used. 
This chapter summarizes the most common modulation protocols in use today. 

It is noteworthy that these modulation protocols were explored before the analytical 
models associated with information theoretic concepts were introduced. Shannon [1] 
revised the prevailing communication system model, identifying two-stage encoding 
and decoding processes as indicated in figure 3.1. Source data is first translated into 
a binary data stream based on a user-specified fidelity criterion, which dictates the 
differentiable number of source states and how often the source makes meaningful 
changes. This source-encoding process identifies a binary data rate R characterizing the 
source and provides the basis for the second stage, channel encoding, which translates 
the binary data stream into channel symbols appropriate to the modulation scheme 
in use. 

At the receiver, this two-stage process is reversed; channel decoding determines 
which channel symbol had been transmitted for each data epoch, reporting its 
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Figure 3.1 Generic communication system model 


decisions as a binary data stream. Then, a source decoder translates this stream into 
signals appropriate to the user. 

The channel characteristics (receiver noise level, fading rates, and so on) and symbol 
parameters (alphabet size and symbol duration) determine the capacity of the channel 
to handle data. Shannons theory specifies that as long as the source data rate is less than 
the channel capacity, appropriate channel encoding can be devised to make the over¬ 
all error rate vanishingly small. Techniques to accomplish this are beyond the scope 
of this work, but such error reduction is often based on long code words, each com¬ 
posed of multiple channel symbols. Such codes add redundancy to the data stream, 
allowing raw channel symbol errors to be detected and corrected at the expense of 
computational complexity [2]. 

This chapter describes the modulation schemes appropriate for use in the HF chan¬ 
nel regime. As was discussed at the end of section 2.2, the ionospheric medium 
supports reliable reflection/refraction propagation for HF ground-to-ground commu¬ 
nications, traditionally using a narrowband transmitted signal. It is useful to consider 
the taxonomy of the HF modulation techniques illustrated in figure 3.2. The branches 
of this diagram indicate the choices available to the communication system designer; 
each must be examined and evaluated on the basis of the type of information to be 
transmitted, the available signal bandwidth, the expected channel utilization, and the 
required communication performance, all in the context of the ionospheric propaga¬ 
tion environment. The first level in the taxonomy of figure 3.2 is based on the form 
of the information to be transferred: analog or digital. The second tier is based on the 
signal characteristic that conveys this information: amplitude, frequency, or phase. 
The third tier is only relevant to digital signals and is based on whether the signaling 
alphabet is binary or of higher order. 
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Figure 3.2 Taxonomy for HF signal modulation formats 


Historically, the earliest HF transmission format, Morse code, was motivated by a 
very simple form of available transmission equipment: a radio-frequency oscillator and 
a simple keying device. This technology was adapted directly from the wire-line tele¬ 
graph. But the desire to communicate by voice quickly led to amplitude-modulated 
broadcasts, similar to those heard today at HF frequencies in the international broad¬ 
cast bands. Digital transmissions were then developed to convey data, first with 
On-Off keying, then binary frequency-shift and phase-shift keying approaches. More 
recently, but still more than fifty years ago, M-ary modulation was developed. Today, 
an HF channel surfer can observe all of these modulation types just by tuning a 
narrowband HF broadcast receiver across a portion of the HF band. 

The most recent development regarding HF signal modulation involves wideband, 
spread-spectrum transmissions. Inspired by the development of code-division mul- 
tiple access (CDMA) technology for interference-resistant communications in the 
VHF and UHF bands, exploratory efforts to apply these methods at HF began in the 
mid-1980s [3]. Recognizing that the ionosphere supports some HF communications 
capabilities over several megahertz at any given time, albeit with frequency-dependent 
propagation characteristics, a coherent wideband signal with appropriate modulation 
might provide a robust low-data-rate 1 channel between transmitter and receiver. 

This robust channel arises because most HF interferers are truly narrowband and 
independent of each other. A signal carrying narrowband data, using a spread- 
spectrum format based on wideband pseudo-random noise (PRN) sequences known 


1 The term “low” data rate Is relative to the large bandwidth used for such transmissions. 
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to both transmitter and receiver, would manifest substantial redundancy. This format 
would allow the narrowband data to be recovered at the receiver even though signifi¬ 
cant portions of the transmission are affected by interference or channel fading. In the 
HF environment, such a signal would be relatively immune to interference and could 
be broadcast with low-enough power to avoid interfering with other concurrent users. 
Despread processing at the receiver, using the known spreading code, would provide 
sufficient processing gain to support low channel error rates despite cochannel inter¬ 
ference and varying propagation characteristics across the bandwidth occupied by the 
wideband signal. Experiments with a 1 MHz bandwidth signal have shown that this 
approach is viable [3]. Challenges associated with antenna bandwidth and interference 
to users located near the transmitter remain. 

This chapter summarizes the most common modulation approaches used for HF 
communications of both analog and digital data. It is worth noting that while the HF 
communication modality has become “old fashioned” and largely replaced by commu¬ 
nication satellite technology, it remains as a backup approach for national emergencies 
should the more modern communications media become unavailable. To assure com¬ 
patibility among manufacturers of such backup equipment, government standards for 
the various modulation techniques were developed in the 1990s. These standards are 
briefly described in section 3.6. 

3.2 Analog Signal Modulation Alternatives 

Modulation for analog information-bearing signals was developed to transmit voice 
over long distances, originally over wire but then adapted to radio. Options for such 
modulation include amplitude, frequency, and phase, but the nature of the HF chan¬ 
nel, as discussed in detail in chapter 2, makes amplitude modulation the most useful. 
This section describes the several approaches to amplitude modulation employed for 
HF transmissions. Historically, these techniques were modified for the transmission 
of digital information once a need for such data transmissions arose. This section 
reviews several common ways in which amplitude modulation is implemented to 
communicate analog signals at HF. 

These analog amplitude modulation approaches are straightforward: a modulat¬ 
ing waveform is used to specify the instantaneous amplitude of the RF carrier signal 
to be transmitted. But there are variations in the way this modulating waveform is 
applied to the RF waveform and how the result is processed after modulation but 
before transmission. First, a basic modulation process, conventional amplitude modu¬ 
lation, is described in detail; this discussion is followed by more superficial descriptions 
of several popular alternatives. 
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3.2.1 Conventional Amplitude Modulation 


Generally, a modulating waveform can be any signal with a bandwidth much less 
than the radio frequency of the carrier. Most modulating waveforms exhibit both 
positive and negative values, and the modulation scheme must account for this fea¬ 
ture somehow. Conventional amplitude modulation uses a voltage bias to assure that 
the modulating waveform is always positive. Then the offset modulating waveform is 
applied to the radio frequency carrier as a scale factor on its amplitude. 

Specifically, consider a sinusoidal carrier signal c(t) specified by 

c(t) = A 0 cos 2ixft (3.1) 


and a modulating signal a(t)\ 

a(t) = l-\- m cos 2jif m t. (3.2) 

This modulation signal exhibits a variation assumed to be much slower than that 
of the sinusoidal carrier. As long as m, the modulation index, is <1, this signal 
is positive, and it can be used to impose an amplitude variation on the carrier 
(if m — 0, there is no modulation, and the carrier amplitude remains constant at 
A 0 ). Then, for the carrier of amplitude A 0 , the signal exciting the transmit antenna 
becomes 

s(t) =,a(t)A 0 cos 2itf c t (3.3) 

=A 0 ( 1 + mcos2icf m t) cos27 xf c t. (3.4) 

This expression can be expanded using trigonometric identities to 


A 0 m r r A 0 m 

s(t) —A 0 cos 2nf c tH —— cos 2n(f c -cos 2tc(j c (3.5) 


From this expression several signal characteristics can be seen: First, there are three 
superimposed signals, one at the frequency of the carrier f c , a second at a frequency 
greater than the carrier by the modulating frequency^, and a third at a frequency 
less than the carrier by f m . These last two signals are at frequencies known as sideband 
frequencies , and these depend on the frequency of the modulating tone; they carry the 
information to be transmitted by the modulated signal. Second, the carrier amplitude 
is unchanged by the modulation index m\ even if the modulation index is zero; that 
is, no modulation, the carrier is present. Third, the amplitudes of the two sideband 
signals are equal and depend on the extent of the modulation being imposed on the 


carrier. 
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-f 0 f f-> 

c c 


Figure 3.3 Example spectrum of amplitude modulated signal: (a) modulating signal 
spectrum, (b) RF signal spectrum 

It is straightforward to show that more complex modulation, say with two or more 
modulating frequencies, will contribute additional sideband frequencies to this sim¬ 
plest case. Each modulating frequency will contribute two sideband frequencies, one 
above and one below the carrier, separated by its modulating frequency; the more 
complex modulation creates a signal cluster close to the carrier frequency, as shown in 
figure 3.3. Since the carrier is unchanged by the addition of these modulating frequen¬ 
cies, it carries no information; the information in the composite signal is concentrated 
in the sideband frequencies. 

Note that the total bandwidth occupied by the two sideband clusters, with a carrier 
in the center, equals twice the highest modulating frequency. If the information to be 
transmitted has a substantial bandwidth, the modulating frequency must be higher, 
and therefore the modulated signal will occupy a wider bandwidth. For a simple tele¬ 
graph signal at 100 words per minute, the highest modulating frequency needed is 
about 120 Hz, leading to a bandwidth of 240 Hz. Voice signals of telephone circuit 
quality might require a bandwidth of 5 kHz. 
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It is important to note that the modulation index reflects the degree to which the 
carrier is modulated. In the simple case of one modulating tone, if m = 1, the car¬ 
rier wave is completely modulated. In this case, each sideband frequency is one-half 
the amplitude of the carrier and therefore contains one-fourth the power of the car¬ 
rier. The total power in the sidebands is one-half that of the carrier (two sidebands), or 
one-third of the total signal power. For weaker modulation, this power level is reduced. 
Generally, in order to maximize the power used to transmit information, a communi¬ 
cator will want to operate with a modulation index close to unity. Note also that the 
circuitry modulating the carrier must not create distortion in the process. Such dis¬ 
tortion can arise if the degree of modulation applied to the carrier is a function of the 
modulating frequency or if the modulation envelope exhibits frequencies not present 
in the modulating waveform as a result of amplifier amplitude distortion. 

When the modulation of an amplitude-modulated signal involves multiple tones 
or a continuous spectrum, m must be chosen such that the composite modulating 
waveform a{t) does not exceed the amplitude of the carrier; that is, max\m • a(t )] < 1. 
This condition will limit the allowable power in the modulating waveform, but it 
will avoid distortion and allow simple receivers to recover the modulating waveform 
accurately. 

As will be discussed later, this conventional AM signal is inefficient in its use of 
bandwidth, requiring twice the bandwidth of the modulating signal. It is also ineffi¬ 
cient in its use of power because' the carrier contains two-thirds or more of the total 
transmitted power but carries no information. As a result of these inefficiencies, alter¬ 
native signal formats for analog information transmission have been developed and 
utilized for appropriate applications. Nevertheless, because conventional AM signals 
include a carrier in the transmission, they can be used with relatively inexpensive 
receiver hardware. 

Although any theoretical limitation to the bandwidth of an amplitude-modulated 
signal is, at HF, perhaps 1 percent of the carrier frequency, ionospheric-induced distor¬ 
tion and a crowded spectrum limit the usable channel bandwidth to a few kilohertz. 
Human speech can be reasonably clear and intelligible, but the limited channel band¬ 
width has a deleterious effect on music. Nevertheless, shortwave broadcasts have 
successfully utilized the HF channel since the 1920s using a signal, such as a{t ), that 
includes an offset from zero to assure the modulating waveform is always positive. 
Since the carrier envelope cannot be negative, a modulating waveform that creates 
such an “overmodulated” condition would produce unacceptable signal distortion. 

Because this is not a text on communications methodologies, descriptions of the 
remaining forms of analog data modulation will be presented without mathematical 
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detail. The intent is to identify those forms commonly used in the HF regime. The 
reader is directed to the references [6, 9,10] for more detailed modulation descriptions 
and relevant performance analyses that provide quantitative performance comparisons. 

In summary the form of amplitude modulation described in this section is known 
as conventional amplitude modulation ; the modulation process is merely multiplication 
of the sinusoidal carrier-waveform by the modulating signal. The resulting spectrum 
is narrowband, with information-bearing sidebands on either side of a strong carrier; 
the two sidebands exhibit shapes that are mirror images of each other, based on the 
shape of the baseband modulating waveform. As a result, they are redundant, and 
one can consider eliminating one of them. Further, the strong carrier, which contains 
the majority of the signal power, is helpful to the operation of simple receivers but 
carries no information. In the spirit of enhanced broadcast efficiency, its power can 
be reduced, or even eliminated, at the expense of more complex receiving equipment. 
These power- and spectrum-saving approaches are the motivation for the modulation 
designs described next. 

3.2.2 Double-Sideband Suppressed Carrier Modulation 

Double-sideband suppressed carrier modulation (DSB-SC) is conventional amplitude 
modulation without the presence of a carrier component in the spectrum. It is easily 
generated as the product of the analog modulation and carrier waveforms but without 
a fixed, constant amplitude component included in the modulating waveform; such 
a component is explicit in the specification of the modulation for the conventional 
amplitude modulation format of equation (3.2). 

It is apparent that the DSB-SC form of amplitude modulation is more power- 
efficient than conventional AM because it does not waste power providing a carrier. 
However, demodulation at the receiver becomes a challenge: the receiver must recon¬ 
struct an appropriate carrier signal on its own. This requirement leads to a more 
sophisticated and costly receiver architecture, which can discourage the use of this 
transmitted signal format in many applications. In addition, an error in the injected 
carrier frequency can result in distortions of the original analog signal; however, the 
signal redundancy provided by the double-sideband structure allows this problem 
to be mitigated. Further, DSB-SC modulation is no more bandwidth-efficient than 
conventional AM. 

Some transmitters use a variant of the DSB approach by transmitting a weak carrier 
along with the sidebands. Doing so is beneficial to a potential receiver because it can 
lock onto this weak carrier and use it to generate an appropriate local oscillator for 
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processing the information-bearing sidebands. The technique is used widely enough 
to earn a special name: vestigial carrier double-sideband modulation [5, 6]. 

Figure 3.4b shows the spectral characteristics of the DSB-SC signal format; for 
clarity only positive frequencies are shown. Contrast this spectrum with that shown 
for the conventional amplitude modulation signal sketched in figure 3.4a. 

3.2.3 Single-Sideband Amplitude Modulation 

In the previous section, DSB-SC modulation was introduced and shown to be 
more power-efficient than conventional AM. Nevertheless, because DSB transmits 
redundant signals, both above and below the carrier frequency, it is still bandwidth- 
inefficient. This inefficiency is addressed with single-sideband suppressed-carrier (SSB- 
SC) modulation, which is identical in format to DSB-SC modulation except that one 
of the two sidebands is filtered out and not transmitted. Because either sideband can be 
removed, the nomenclature upper sideband (USB) and lower sideband (LSB) are used to 
identify the sideband that remains. Neither single-sideband modulation format wastes 
power or bandwidth on a redundant second sideband, or on a carrier. 

SSB transmission faces a challenge with filtering because the separation between 
the upper and lower sidebands is only twice the lowest frequency in the modulation 
waveform. For most signals of interest to HF operators, this can be quite small, mak¬ 
ing the design of a sideband elimination filter difficult. Reception also has a challenge 
with respect to local oscillator regeneration, a problem that was noted in conjunction 
with DSB-SC. Flowever, without the redundancy of the double-sideband structure, 
SSB-SC receivers have no means to identify a carrier frequency error. As a result, 
demodulated signal distortion is common with SSB-SC, often observed as an incorrect 
pitch in demodulated voice signals. Nevertheless, the bandwidth and power efficiencies 
of SSB signals lead to their popularity in the FIF world. 

3.2.4 Vestigial Sideband Amplitude Modulation 

The final alternative for analog modulation of FIF carrier signals is similar to the single¬ 
sideband amplitude modulation described in the previous section. In addition to the 
single-sideband signal that efficiently carries information to the receiver, a low-level 
carrier component is also transmitted. This format provides a low-level, appropri¬ 
ately phased carrier signal that a relatively simple receiver can lock onto, thereby 
reducing the complexity of regenerating this carrier without assistance; this locally 
generated amplified carrier can be used to demodulate the sideband signal. Also, as 
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Figure 3.4 Spectral characteristics of amplitude modulation formats 
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Table 3,1 Comparison of analog modulation formats 


Modulation format 

SNR 

Bandwidth 

Carrier included 

Conventional AM 

/2N, m< 1 

2 B 

Strong 

DSB-SC AM 

A 2 c /2N 

^ 2B 

None 

SSBAM 

A 2 c /2N 

B 

None 

VSBAM 

A 2 c /2N 

B 

Weak 


mentioned earlier, the carrier and lowest frequency components of the modulating 
signal are relatively close together in frequency; as a result, the additional bandwidth 
needed to support the complete vestigial-sideband amplitude-modulated signal is not 
substantial. 

3.2.5 Analog Signal Performance Comparisons 

In this section, performance comparisons are made among the various analog signal 
formats described. Derivations of these performance measures are not included here 
because they are readily available in any of the many texts covering communication 
systems [6, 9, 10]. Nevertheless, such comparisons are useful because they influence 
the relative popularity of the various signal formats. 

These comparisons are presented in table 3.1. It is apparent that all four formats 
provide similar signal-to-noise ratios. Only conventional amplitude modulation could 
be weaker in this regard and then only if care is not taken to keep the modulation index, 
m , close to unity. Additional factors influencing the choice of modulation include 
occupied bandwidth and receiver requirements for carrier regeneration. These factors 
are also tabulated in table 3.1. 

3.3 Digital Modulation Approaches 

Turning next to the transmission of digital data, binary approaches will be covered 
first. Amplitude, frequency, and phase modulation schemes will be addressed in order. 

3.3.1 Binary Data On-Off Transmissions 

The simplest possible digital amplitude modulation approach is known as On-Off 
keying, or OOK; it is based on a single carrier that is turned on and off to convey 
information using an agreed timing protocol. In short, a pulse represents a binary 1, 
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and no pulse represents a 0. The receiver must have a reliable clock, or time base, in 
order to separate symbols and to determine when a longer pulse represents two Is in 
succession. This approach is perhaps the simplest form of several pulse code modu¬ 
lation schemes commonly used for binary data. But the on or off signaling approach 
is also used for the much larger alphabet associated with Morse code. Morse code 
transmissions are described in an appendix. 

In an OOK protocol, binary data is communicated using so-called mark and space 
signals over a specified interval T. A mark is represented by the transmission of a carrier 
with specified amplitude A, phase 0, and duration T, while a space is transmitted by 
turning the carrier off for the interval T. Thus, 



A cos(2jrf c t — 0), 



for mark 
for space. 


(3.6) 


At the receiver, the demodulator must first synchronize itself to the arriving signal 
symbol timing and then assess, for each transmit interval, whether a carrier is present or 
not. Analysis of this situation has been covered in many texts [2, 5, 6, 9,10] on impor¬ 
tant topics including phase-tracking loops, channel dynamics, and signal-to-noise 
requirements for accurate tracking. Two important situations have been identified. 
If the receiver can maintain an accurate knowledge of signal phase, using a preci¬ 
sion oscillator, coherent reception techniques can be utilized. Such receivers effectively 
perform matched-filter reception, cross-correlating the arriving signal with a carrier 
replica. Assuming that the receiver clock is perfectly synchronized with its transmitter 
counterpart, the output of the in-phase correlation filter is sampled and the result used 
for the mark-space decision. If signal phase cannot be accurately tracked, noncoherent 
receiver approaches must be used. 

In the references, it is shown that the probability of error for equiprobability mark 
and space is a function of the signal-to-noise ratio at the receiver, and if this can be 
measured, an optimum threshold can be utilized to obtain an error probability 



(3.7) 


where y is the signal-to-noise ratio, and erfc is the complementary error function. This 
error probability relationship is plotted in figure 3.5. It is worthwhile to compare the 
performance of such a coherent receiver with its noncoherent counterpart. In the latter, 
the carrier replica is not available at the receiver, and envelope detection methods must 
be used to compare the received signal expected for a mark with that expected for a 
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space. In this case, the error probability becomes 


This relationship is also plotted in figure 3.5. It is apparent from this figure—and 
can be shown by approximating the tail of the relevant distribution—that the per¬ 
formance of the coherent and noncoherent receivers, for optimum threshold settings, 
are asymptotically identical as the signal-to-noise ratio increases. This is particularly 
important in the case of receivers for HF digital communications because the iono¬ 
spheric channel effects, described in detail in the previous chapter, introduce errors 
in the carrier estimate needed at the receiver for implementation of coherent receiver 
techniques. Thus, the similar performance of coherent and noncoherent receivers at a 
high signal-to-noise ratio is particularly relevant at HF. 

These error probability predictions, however, are based on the assumption that an 
appropriate threshold can be set for the mark-versus-space decision. With OOK, this 
threshold should be set at a point that depends on the mark signal strength. How¬ 
ever, at HF signal strength is greatly subject to channel fading, which can be rapid and 
unpredictable. Even with a sophisticated automatic gain control (AGC), the receiver 




Figure 3.5 Error probability for coherent and noncoherent binary OOK receivers 
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will not be able to reliably maintain a timely estimate of the true mark signal level. 
Thus, the error rate will be greater than these mathematical predictions, which can 
only be regarded as lower bounds. This inability to maintain an appropriate threshold 
for the mark-versus-space decision has led to the consideration of alternate modu¬ 
lation approaches, in particular those associated with angle, rather than amplitude, 
modulation. Such angle modulation schemes will be described and evaluated next. 

3.3.2 Binary Frequency-Shift Keying 

Binary frequency-shift keying (FSK) modulation was developed to avoid the demod¬ 
ulation issues associated with the fading HF channel. FSK employs pulses of energy 
at two separate carrier frequencies to represent 0 and 1, respectively. Since a specific 
frequency can be viewed as a linear change in phase angle, frequency modulation is 
a special case of phase, or angle, modulation. Nevertheless, it is easy to implement an 
FSK modulator. If the frequency of a transmitting power amplifier can be adjusted by a 
tuning voltage, the modulating waveform, with appropriate scaling and/or condition¬ 
ing, can be applied to this amplifier, thereby affording a simple frequency-shift keyed 
implementation. Demodulation can be straightforward with a single discriminator 
circuit that responds differently to the two frequencies in use. 

Alternatively, a two-filter receiver can be introduced. Such a receiver employs a pair 
of filters tuned to the two carrier frequencies in use. Outputs of the two filters are com¬ 
pared, and the symbol decision is based on the filter with the larger output. Assuming 
that the two carriers used for symbol modulation are close together, with flat fading 
the channel can be assumed to be very similar for them both. Thus, there is relative 
immunity to channel fading when the two-filter receiver is used. Further, the imple¬ 
mentation is also quite simple. As a result of this simplicity—and its relative immunity 
to channel fading—FSK has become a common form of modulation for HF digital 
data transmission. 

Of course, more sophisticated alternatives to these simple modulation/demodu¬ 
lation approaches can be implemented with associated performance enhancements. 
A coherent form of FSK receiver has been used successfully. Such a receiver effec¬ 
tively implements a pair of matched filters that respond not only to the frequency 
of the received signal but also to its phase. Such filters correlate the received signal 
with the two possible received waveforms and estimate the signal energy associated 
with each hypothesis. Because of channel uncertainties, such a receiver must con¬ 
tinually revise the filters using sophisticated channel estimation processing, making 
the receiver more costly than the simpler noncoherent approaches. As a result, the 
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noncoherent two-filter receiver design is often used, and the remainder of this section 
will focus on its performance. 

Assuming that the symbols are each represented by rectangular pulses at the same 
constant amplitude, the transmitted signal becomes 



I Acos(2Ttf\t), 
A cos{2nfit ), 


for mark 
for space. 


(3.9) 


Here, f\ and fi are constant over the duration of the pulse that has duration T. 
More sophisticated transmitter systems might use pulse shaping to minimize spectral 
occupancy, but this level of detail will be ignored here. 

If the performance of a noncoherent receiver, which compares the envelope- 
detected responses from a pair of filters tuned to f and fi> assuming that each filter 
does not respond to a signal at the alternative frequency (no overlap) and that the noise 
in the unoccupied channel is independent of the noise in the other (no cross talk) is 
analyzed [6], the symbol error probability can be shown to be 



(3.10) 


for noncoherent FSK, where y is the signal-to-noise ratio. This expression for error 
probability is plotted in figure 3.6. 

Although coherence is difficult to assure in the HF propagation environment, it is 
instructive to examine an error probability expression for coherent FSK and to com¬ 
pare it with the noncoherent receiver error probability just derived (equation (3.10)). A 
coherent receiver is assumed to have replicas of the two possible transmitted signals, 
including accurate phasing, available to it; only the amplitude is unknown. The per¬ 
formance of such a coherent receiver can also be analyzed [10]; the error probability 
for coherent FSK is also plotted in figure 3.6. A comparison between coherent and 
noncoherent FSK performance can be made using this plot or alternatively, using an 
asymptotic expression for the error functions in the two cases. 

At high signal-to-noise ratio, the error probability of a coherent receiver becomes 


asymptotic to 



(3.11) 


while the same quantity for a noncoherent receiver was given in equation (3.10). Both 
have the same exponential factor. 
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Figure 3.6 Error probability for coherent and noncoherent binary FSK receivers 

TThis comparison makes it evident that the performance of coherent and non¬ 
coherent FSK are dominated by the identical exponential term when the signal-to- 
noise ratio is high. Although there is a difference in the coefficient, this difference 
becomes unimportant when the goal is to achieve very low error rates. Fortunately, 
the HF propagation channel is not well suited for coherent FSK reception, and this 
analysis shows that noncoherent FSK performance can be used successfully when very 
low error rates are the goal. 

3.4 Binary Phase-Shift Keying 

As explained in the previous section, phase-shift keying (PSK) is another form of angle 
modulation. In this section, both coherent PSK and differential PSK are described, and 
their error probabilities are compared. 

3.4.1 Coherent PSK 

Binary PSK is very simple. At the transmitter, 0 and 1 data is replaced by —1 and +1 
in the modulating data stream, and the result is multiplied by a sinusoidal carrier. The 
reader will recognize this process as a DSB-SC approach, with a bipolar modulating 
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waveform. At the receiver, cross-correlation processing is used; it involves multiplica¬ 
tion of the received waveform by a carrier replica, followed by integration over a time 
interval equal to the transmitted symbol duration. This process is identical to that 
described for coherent FSK in the previous section. 

Analysis of coherent BPSK [10] shows the error probability to be 

P e = -zv{c(ffy). (3.12) 

Thus, ideal PSK has a 3 dB advantage over coherent FSK. But these results pass over 
the important requirement of generating a perfect in-phase reference at the receiver. 
If the reference is not in phase but is in fact in error by a phase angle 8<p, then the 
in-phase channel will experience a drop in signal level equal to cos 8(j), in which case 
the error probability would be modified accordingly, 

P e = -erfc( A /y cos<$0). (3.13) 

2r 

If the phase error is small, the loss in signal-to-noise ratio is small, and the error proba¬ 
bility will not suffer excessively. Flowever, with the F1F channel, phase errors are apt to 
be unacceptable unless they are addressed using appropriate carrier-tracking circuitry. 
It is beyond the scope of this work to address the design and performance of such 
circuitry. 

3.4.2 Differential Phase-shift Keying 

An alternative to such a circuit solution employs a modified PSK signaling format 
known as differential PSK (DPSK). DPSK encodes the transmitted symbols differ¬ 
entially: if a given symbol is equal to its predecessor, the transmitted carrier phase is 
unchanged; if the two symbols are different, the transmitted phase shifts by 180°. At 
the receiver, a coherent detector is used, but the reference is the previous received pulse, 
delayed by one symbol duration. This detector design becomes the primary difference 
with respect to that used with the coherent PSK approach. With this detector, it should 
be apparent that the reference has the same level of additive noise as the received signal; 
because the received and delayed signal are separated in time by a symbol duration, 
however, these noise waveforms are uncorrelated. 

The error probability for DPSK is given by 

1 
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Comparison of this error rate for DPSK with that of coherent BPSK, equation (3.12), 
is worthwhile. As was seen in the comparison between coherent and noncoherent FSK, 
the performance of DPSK is dominated by the exponential factor, which is the same 
for both coherent PSK and DPSK. However, a detailed examination of the previous 
expressions shows that DPSK error performance does improve with signal-to-noise 
ratio with the same exponential factor, as with coherent PSK, but at low signal-to- 
noise ratio, the phase reference provided by the previous symbol is insufficient for 
reliable demodulation; as a result, there is a greater performance penalty relative to 
coherent PSK in such an environment. Details are covered in the references [2, 10]. 

3.4.3 Binary Modulation Error Probability Comparisons 

In the previous sections, several important binary modulation schemes have been 
described. It is useful to understand how they compare using error probability compu¬ 
tations as a measure of performance. Figure 3.7 provides this information in the form 
of a multicurve plot of error probability as a function of signal-to-noise ratio. Table 3.1 
contributes to this comparison by summarizing the various schemes with respect to 
their error probability at low SNR, their asymptotic error probability behavior at 
high SNR, and comments regarding their implementation for the HF propagation 
environment. 



Figure 3.7 Error probability for several binary modulation receivers 
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Table 3.2 Performance summary of binary modulation options 


Modulation 

format 

SNR(dB) for 

P e = 10~ 2 

SNR(dB) for 

P e = 10“ 6 

Comments 

Noncoherent OOK 

11.94 

17.20 


Coherent OOK 

10.34 

16.55 

Requires precise carrier 

Noncoherent FSK 

8.93 

14.19 


Coherent FSK 

7.32 

13.53 

Requires precise carrier 

DPSK 

3.92 

11.18 


Coherent PSK 

4.32 

10.53 

Requires precise carrier 


3.4.4 M-ary Modulations 

In the previous sections, two binary modulation approaches have been described. In 
each, a data source was assumed to generate a sequence of binary data values at a rate 
R bits per second and to present them to a transmitter for communication to a dis¬ 
tant receiver. In this section, the limitation to binary modulation will be removed, 
but the assumption of a binary data source will be retained. This limitation concern¬ 
ing the source may seem overly restrictive, but it is well known that most, if not all, 
data sources can be encoded with any desired accuracy into such binary streams at 
an appropriately chosen binary symbol rate. Therefore, it will be convenient to limit 
discussions here to such binary data sources. 

In a binary modulation approach, data is transmitted over a communications chan¬ 
nel using two symbols, one to represent each of the binary data bits arriving from 
the source. Consider grouping k successive binary data bits into <c super symbols,” 
each of which is represented on the channel by its own unique modulation sym¬ 
bol. Since k binary data bits admit M = distinct possibilities, the modulation 
scheme must allow for M symbols. Such schemes are considered in the subject M-ary 
modulation. 

The topic of M-ary modulation is fairly complex because there are many modulation 
options and a variety of approaches to evaluating them. Basic questions about a specific 
modulation scheme include: 

• How much energy does each symbol require for reliable transmission? 

• What channel bandwidth is required? 

• What is an appropriate value of M ? 
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• What is the complexity of the associated transmitter and receiver? 

• Can coherent processing be implemented? Is it feasible? Useful? 

System design tradeoffs abound. This section can only serve as an introduction 
to this complex topic but will focus on the two most common M-ary modulation 
approaches found in HF communication systems today, M-ary phase-shift keying 
(MPSK) and M-ary frequency-shift keying (MFSK). Attention will be paid to per¬ 
formance as a function of signal-to-noise ratio, to bandwidth requirements, and to 
the effects of increasing M, the size of the symbol alphabet. 

3.4.4.1 M-ary Phase-Shift Keying 

Binary phase-shift keying was defined as the transmission of two symbols differing in 
phase by 180°. If the first symbol is based on a cosine wave, the second is also a cosine 
wave but with a negative sign. Both these symbols utilize in-phase components, and the 
quadrature components go unused. Thus, it is possible to design a 4-ary symbol alpha¬ 
bet utilizing both in-phase and quadrature signals and obtaining two symbols from the 
positive and negative cosine wave and two more from the positive and negative orthog¬ 
onal sine wave. The consequence is a relatively simple receiver design and an attractive 
performance characteristic. This approach suggests a generalization for M > 4. 

Define a set of symbols, all with constant amplitude but differing in phase by 50 = 
360°/Af. For simplicity, assume the first symbol is a pulse from a positive cosine wave. 
In the phase plane, such a symbol can be represented by a positive in-phase vector with 
a magnitude equal to the envelope of the wave. The second symbol can be represented 
by a vector of the same magnitude but at an angle 8(f). Generally, the i — th symbol 
can be represented by a vector of this magnitude and a phase angle 0 Z - = (i — 1)50. For 
example, the phases of the symbols for an 8-ary system become: 0°, 45°, 90°, 135°, 
180°, 225°, 270°, and 315°. Figure 3.8 shows these vectors in the phase plane. 

Performance predictions for such a signal involve the numerical integration of the 
appropriate probability distributions. These are based on the usual models for in-phase 
and quadrature components of a signal observed with an additive, white Gaussian 
noise disturbance. There is no loss in generality by considering the errors associated 
with the symbol assigned to a cosine wave with zero phase. For a coherent receiver, 
this symbol will be phase aligned with the in-phase channel reference. Assuming 
M -ary phase-shift keyed modulation, a correct symbol will be obtained if the phase 
of the demodulated signal lies in the phase sector of width 2tz/M> centered on zero 
phase; that is, if the phase of the received signal is estimated to be ±7r / M . 
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Figure 3.8 Phase plane diagram of 8-ary PSK vectors 


Following [6], one can obtain'an expression for the probability density function of 
the output phase, p{6 ), from such a receiver and then numerically integrate it over an 
appropriate sector to obtain the probability of correct reception, P c ; the probability of 
error follows by the usual P e = \ — P c . The results of such integration are presented in 
figure 3.9 for M = 4, 8, and 16. 

It is worthwhile pointing out that a symbol error does not necessarily lead to errors 
in all k bits. This issue will be addressed further after another form of Aftary signaling, 
Aft ary FSK, is described. 

3.4.4.2 M-ary Frequency-Shift Keying 

In section 3.3.2, binary FSK modulation was introduced. With BFSK, two tones 
are used, and a 0 is signaled by one of them while a 1 is signaled by the other. 
Aft ary frequency-shift keying (MFSK) is an extension of that idea: M tones are used 
to signal Aft different symbols representing k = log 2 M bits. It is important that the 
tones be adequately spaced. If the symbol duration is T and the tones are at least \/T 
apart in frequency, they will be orthogonal, and a simple filter-based receiver can be 
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Error probability for M-ary PSK 



Figure 3.9 Error probability for M-ary PSK modulation 

implemented to demodulate the received signal. If the tones are closer together, the 
tone that causes a peak response in one filter will cause some response in an adjacent 
filter; this cross talk will increase the error rate. 

Discussion here will be limited to this simple implementation; the received sig¬ 
nal is fed to M filters, each of which is tuned to one of the symbol tones; an 
envelope detector is used at the output of each filter. Based on the detector out¬ 
put, a decision as to which symbol was transmitted is made. This is a noncoherent 
receiver. 

An equivalent coherent receiver could be built using M filters, each matched to 
a different symbol tone. But such a receiver requires that M coherent references be 
implemented, and each of them must track the frequency and phase of one potential 
transmitted tone. Because of this complexity, a coherent MFSK receiver is relatively 
unsuited for use at HF; it will not be discussed further. 

Analysis of the noncoherent receiver for MFSK signals is straightforward. M — l of 
the filters are responding to inputs that are noise alone; only one of the filters receives 
a tone. The remaining M — 1 filters are receiving noise alone, and their outputs are 
independent of each other and of the output of the filter receiving the current symbol 
tone. The probability of error can be obtained [6] by considering the probability that 
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M-ary FSK error probability 



the envelope at one of the incorrect symbol filters exceeds the output at the filter with 
-a tone input. 

This expression is plotted in figure 3.10 for several values of M. 

3.4.4.3 M-ary Modulation System Comparisons 

Now that these two forms of Af-ary modulation have been described and characterized, 
it is possible to compare the trade-offs associated with each in a common scenario, 
communicating the output of a source that generates data at a fixed rate R. With 
MFSK, the duration of a symbol can then increase in proportion to k = log 2 M. This 
means, of course, that the symbol signal-to-noise ratio y increases in proportional 
to k as well, implying, since the signal-to-noise ratio per bit yy equals y / k, that the 
signal-to-noise per bit remains constant. 

One can show that the error probability can be upper bounded by 

^ p < ]_2 k e~ Xo ^' lMyb ^ 2 = - e ~ lo &2 M (Yb/ 2 -\n2) 

e ~ 2 2 


(3.15) 
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Thus, as long as y > 2 In 2, the probability of error can be reduced to any level by 
increasing M. This is a well-known result due to Shannon. The cost is an increase in 
bandwidth because the MFSK tones must maintain a spacing equal to the inverse of 
their duration, which is inversely proportional to k. Since there are M = 2^ tones, the 
occupied bandwidth, relative to communication rate, becomes 2^/ k, an exponential 
growth as a function of the number of bits carried by a symbol. In addition to band¬ 
width, there is a cost associated with the receiver, which also requires M = 2 k filters, the 
number of which must increase exponentially. Modern digital receivers can mitigate 
this cost using fast Fourier transform (FFT) methods, but detailed implementation 
and spectral availability studies would be required to determine whether the reduction 
in error probability would be worth the costs. 

It is appropriate to compare the MPSK alternative. In that case, increasing the num¬ 
ber of bits per symbol k —and therefore increasing the duration of the symbol—allows 
the total signal bandwidth to decrease by a factor of k = log 2 M. Flowever, the req¬ 
uisite signal-to-noise ratio per bit, increases as M 2 / log 2 M. Receiver complexity 
need not grow rapidly with M. Thus, the cost tradeoffs are different with MFSK and 
MPSK. 

Before leaving the topic of M -ary modulation, it is appropriate to return to an 
earlier comment regarding symbol versus bit errors. If M > 2, each symbol carries 
more than one bit. If the communicator s goal is to transmit a binary data stream, a 
symbol error does not lead to errors in all bits carried by that symbol. The assumption 
is often made that when an M -ary symbol is in error, the decoder randomly selects 
the k bits associated with that symbol. Consider one of these bit positions. Since there 
are M = 2^ possible binary bit sequences, of which one is correct and the other 2^ — 1 
are incorrect, half of the incorrect sequences will be in error in the bit position under 
consideration. Thus, the average probability of bit error P e jy is related to the probability 
of symbol error P e by 


2 k /2 1 P e 1 

Peb = -r— =- —r ~ -Pi 

2 k -\ 2 \-{\/ 2 k ) 2 


(3.16) 


where the approximation becomes very close for large k. 

To conclude the discussion of modulation options, it is important to emphasize a 
point that has already been made several times. In selecting a modulation approach for 
a specific application, there are a number of critical tradeoffs to consider and various 
constraints to be observed. Information rate, available bandwidth, acceptable bit error 
rate, operating range, channel characteristics, receiver complexity, and practical levels 
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of transmitter power are some of the important factors. And there are others. Thus, 
the design of a modulation system involves multidimensional trades that cannot be 
easily generalized or summarized in a few words. Each situation must be addressed as 
a specific design problem with its own appropriate solution. 

•*. 

3.5 HF Modem Design 

The design of a receiver appropriate for communication over an HF channel must 
deal with two concerns. The first focuses on the demodulation of the signal format to 
be received, while the second addresses any synchronization needed to assure that the 
receiver time and frequency references are compatible and synchronized with those 
of the transmitter. Both of these concerns are greatly influenced by the character of 
a signal that has propagated via an HF link, especially any Doppler shift or channel- 
induced spreading, as well as the modulation features associated with that particular 
signal. Any significant motion of transmitter and/or receiver will also be important. 
This section discusses some of these issues briefly. Synchronization is not included 
because it is a complex subject well addressed by Proakis [10]. 

3.5.1 Critical Channel Factors 

Earlier sections of this chapter presented details concerning several modulation 
approaches used for HF communications and indicated how signal-to-noise ratio con¬ 
tributes to bit error performance in a Gaussian noise channel. But the modulation is 
only one of several components contributing to a reliable HF modem, and at HF the 
channel is not well modeled by simple additive Gaussian noise. Signal fading, chan¬ 
nel dispersion, and cochannel interference are some of the issues facing the modem 
designer. Potential design elements include limited symbol bandwidth, data inter¬ 
leaving, error detection/correction coding, and channel protocols. This section merely 
attempts to make the reader aware of how these modem design features contribute to 
performance at HF. A thorough discourse on any of these topics is well beyond the 
scope of this book. 

HF channel conditions have been widely studied. To facilitate evaluation under 
standardized conditions, the CCIR has recognized the variability of the HF channel 
by defining three sets of conditions, denoted “Good/ 5 “Moderate,” and “Poor,” on the 
basis of thechannel delay spread and the fade rate. These are summarized in table 3.3. 
Even under “Good” conditions, the channel can introduce delay spreads of as much 
as 0.5 ms, which requires channel compensation if this channel characteristic is not 
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Table 3.3 CCIR standardized channel characteristics 


Characteristic 

Good 

Moderate 

Poor 

Time delay 

0.5 ms 

1 ms 

2 ms 

Fade rate 

0.1 Hz 

0.5 Hz 

2 Hz 


to limit usable channel bandwidth. Historically, the HF channel has been limited 
to about 2500 Hz, and delay spread has been the motivation. More recently, mod¬ 
ern modem design approaches have allowed for channel equalization using training 
sequences, and these features have allowed data rates to increase, but the occupied 
bandwidth has remained about the same. 

3.5.2 Channel Error Correction Coding 

Forward error correction (FEC) coding [10] is applied to the incoming binary data 
stream using a coding scheme that reflects the relatively long correlation time that 
the HF channel will impose on the received data. Redundancy bits, which are deter¬ 
mined in a way that allows the occurrence of channel errors to be detected—and to a 
limited extent corrected—are added to the data stream. Following this encoding pro¬ 
cess, in recognition of the channel delay and slow fading, both of which introduce 
symbol dependencies into the received data stream, modem designers have included 
interleaving in the modulation structure; the goal of interleaving is to make succes¬ 
sive channel symbols independent, a requirement for the error detection/correction 
coding to perform effectively. 

Interleaving is particularly relevant for signals transmitted over an HF channel 
because of the fading characteristics of this environment. When interleaving is used, 
digital data is shuffled prior to encoding for transmission. As a result, short-term cor¬ 
relations among channel symbols, introduced by the propagation characteristics, are 
removed by the deinterleaving process. Then, the error correction decoding can oper¬ 
ate on data that have more temporally uniform statistics, as such coding is designed to 
handle. 

Interleaving is implemented by loading successive binary data bits into the rows 
of a memory organized as a matrix, with the number of columns chosen to reflect 
the expected channel decorrelation time and the number of rows equal to the 
number of bits associated with a channel symbol; then, the bits are read out of 
the memory columnwise, each column producing a channel symbol. This process 
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leads to independent channel symbol statistics and (we hope) to the desired code 
performance. 

At the receiver, a similar matrix-oriented buffer is used to reconstruct the origi¬ 
nal data stream. The interleaving spreads out the effect of a short burst of cochannel 
interference onto several successive channel code words rather than affecting several 
successive bits in the same code word. As a result, the FEC, which might have been 
designed to correct one bit, can do so, and the resulting data stream reaches the user 
error-free. The penalty for including FEC and interleaving is a user delay that might 
be unacceptable if it is too long. This is particularly relevant to applications involving 
direct human interactions, such as interactive Internet access. Users are unhappy if the 
result of a keystroke occurs after a relatively long time, perhaps 0.2 s. 

Channel protocol can be a critical component to modem design because it can be 
implemented to provide an error-free communication link, albeit with a channel delay 
that might be unacceptable in some circumstances. Such protocols usually refer to the 
handling of a data stream that is segmented into blocks, each of which is handled inde¬ 
pendently. Each block is transmitted with a sequence number, and the receiver applies 
error detection using the parity check bits incorporated into the block; it then verifies 
that the arriving blocks follow the sequence. If a block is determined to contain an 
error, or if the block is missing, the receiver utilizes a reverse channel to request that 
the block be retransmitted. To assure proper synchronization, the proper receipt of 
each block is acknowledged. When channel conditions are such that no errors occur, 
the link operates with no delay. But when errors induce a retransmission, there is an 
additional delay of one round-trip time per repeat; if channel conditions are partic¬ 
ularly poor, multiple repetitions of many blocks might be necessary, compromising 
the channel throughput significantly. This is the price of error-free communication 

at HE 

3.6 Government Standards 

Many operations of modern government depend on reliable communications. Both 
international and homeland defense, response to natural disasters, and reliable trans¬ 
portation are the most obvious areas where this is true, and there are many others. 
Today, wire line and satellite communications are the primary approaches to pro¬ 
viding the necessary reliable links. But there is a requirement for backup should 
these media fail. The central role of government in these communication require¬ 
ments has encouraged the development of national standards for HF communications 
with a view toward assuring the compatibility of units designed and built by different 
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manufacturers. This section briefly outlines two of these standards and discusses the 
efforts that led to their particular features. 

The most well known of these standards is MIL-STD-188-110(A,B) [19]. It specifies 
in detail four types of modems applicable to HF operation. In addition, there is a sec¬ 
ond important standard, MIL-STD-188-l4l(A,B,C) [20], which details appropriate 
signaling protocols for use in the HF band, including the automatic link establishment 
protocol. This section briefly outlines these standards. 

3.6.1 Standardized Modem Signals 

Although the historical basis for HF communications was rooted in manual operations 
for both data keying and linking protocols, the shortage of experienced and skilled 
operators drove the development of mechanized approaches in both areas. Technical 
design decisions were supported by the then recent theoretical approaches embodied 
in the work of Shannon [1, 6, 9, 2, 7] and many others. This work identified the 
Nyquist rate as an upper bound on performance and led to a variety of new modulation 
approaches that allowed this bound to be approached. It also showed how diversity 
and forward error detection/correction coding schemes could be employed to cope 
with noise bursts and rapid fades. Success in these development activities focused the 
standards on specific modem designs that had proven effective in providing reliable 
link performance. 

3.6.1.1 8-ary Phase-Shift Keyed Modem Signal 

One of the designs covered in the standard uses a single-tone format employing coher¬ 
ent 8-ary phase-shift keying (8PSK) for modulation. It operates at 2400 baud 2 with 
each constant phase symbol lasting 1 -r- 2400 = 0.4167 ms. In addition to the data, the 
modem transmits training sequences that occupy one-third of the channel symbols. 
These training sequences are used to equalize the channel and to thereby reduce the 
effects of varying channel dispersion. A rate 1/2, constraint-length-seven error con¬ 
trol code is used. With the 8-PSK modulation, 7200 binary bits per second (bps) are 
transmitted over the channel, but as indicated, one-third of these are training bits; as 
a result, 4800 bps are available to provide data and error control. Thus, with the rate 
1/2 code, the net channel rate becomes 2400 bps right in line with many other HF 
modems. 


2 Defined here as symbols per second. 
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One problem with this design is the breakdown that occurs when the training bits 
do not lead to accurate equalization of the current channel conditions. In such a situa¬ 
tion, channel distortion dominates reception, and a burst of data errors occurs despite 
the FEC code. Although the training sequences and a strong convolutional FEC code 
are included in the design, the basic waveform, is a phase-modulated constant fre¬ 
quency signal. Consequently, a constant envelope approach to array processing would 
be suitable for reception by an array-equipped sensor. 

3.6.1.2 8-ary Frequency-Shift Keyed Modem Signal 

A second MIL-STD design is a simple 8-ary FSK signal that could be implemented 
with a bank of eight analog filters. This approach is much slower: a 125 baud design 
with symbol duration of 8.0 ms. The 8-ary modulation results in a 375 bps data rate, 
and though inadequate for extensive message traffic, it is suitable for link establishment 
control bursts. Because the eight tones are adequately spaced in frequency, at twice the 
baud rate (250 Hz spacing) there is minimal cross talk between the frequencies. More 
importantly, from the perspective of a multichannel sensor, this spacing is adequate to 
assure the signal is constant envelope, allowing this signal feature to be exploited when 
prosecuting this FSK modem signal with an adaptive antenna array. 

3.6.1.3 Parallel Sixteen-Tone Modem Signals 

The remaining two MIL-STD designs are both of the same parallel-tone modem class, 
aimed at achieving at least a 2400 bps data rate in a 2700 Hz bandwidth HF channel. 
Such a channel manifests substantial delay spread due to multipath in the ionosphere. 
As discussed in chapter^ delay spreads can easily exceed the 0.42 ms associated with a 
2400 symbols-per-second (BPSK or BFSK) baud rate. Original designs of this class uti¬ 
lized multiple low-data-rate subchannels, with the subchannel baud rate chosen such 
that the symbol duration exceeds the delay spread. Even though all subchannel signals 
could be transmitted simultaneously, each subchannel could be processed separately 
after narrowband matched filtering at the receiver. More modern modem designs uti¬ 
lize a frequency spacing that is the inverse of the symbol period leading to orthogonal 
tones. Then fast Fourier transform methods can be used at the receiver to implement 
a simple demodulator. A guard time is used at the start of each symbol epoch to avoid 
interference from tails of the previous symbol. 

Both sixteen-tone and thirty nine-tone signal designs are included in the standard. 
In the sixteen-tone design, each of the tones employs differential 4-ary PS K at 75 baud, 
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for a subchannel raw data rate of 150 bps. Each tone has a null-to-null bandwidth of 
150 Hz, allowing all sixteen tones to fit within the 2700 Hz channel bandwidth 
(16 x 150 = 2400 Hz) as long as they utilize a 150 Hz tone spacing. A 4.2 ms guard 
time is used in receiver processing. No error control coding is used, and as a result, the 
bit error rate is fairly high. With proper spacing at 150 Hz, the tones are orthogonal, 
which simplifies the decoding process, but the envelope of the composite signal no 
longer has the constant envelope feature. 

3.6.1.4 Parallel Thirty Nine-Tone Modem Signals 

More recently a 39-tone signal has been developed and incorporated into the 
standard. It also employs differential quadrature phase-shift keying (DQPSK) sub¬ 
channel modulation, using a lower baud rate of 44.44 symbols per second. 
Sub-channels are spaced at 88.88 Hz so that thirty nine of them occupy about 
3500 Hz. A guard time of 3.5 ms and FEC are employed to achieve a net data rate 
of 2400 bps. The performance of this modem is better than with the sixteen-tone 
modem, but comparative evaluations of the 16-tone and 39-tone modems show 
relative performance to depend on the operating point; that is, the desired bit 
error performance and the associated signal-to-noise ratio, as well as the channel 
conditions. 

3.6.2 Modem Performance Comparisons 

Reference [14] presents a detailed summary of several papers that compare these 
modems, both under simulated CCIR conditions and with measurements of their 
operation over an actual HF channel across Australia in 1993. Under CCIR “Good” 
channel conditions, Pennington [15] showed results suggesting little performance dif¬ 
ference among the three modems when the signal-to-noise ratio was less than 25 dB; 
above 30 dB SNR, the single-tone modem provided the lowest error probability by an 
order of magnitude. For CCIR “Poor” channel conditions and low SNR, the sixteen- 
tone modem provided lower error probability, but the 16-tone modem improved 
substantially at 20 dB and above. These results can be explained by the lack of error 
control coding in the sixteen-tone modem; at low SNR, the FEC coding of the two 
other modems fails, inducing a disproportionate number of errors. In the on-air test¬ 
ing, error performance was characterized as a function of the interference environment, 
quantified by interference burst rate; in such situations, interleaving became the decid¬ 
ing factor. In all these situations, the operating point-—and the environment in which 
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the modem is to operate—determined which modem design should be regarded as 
superior. 

After these evaluations, interest in improved performance led to studies attempt¬ 
ing to understand the relative contributions of the basic modulation, interleaving, 
and error control code. Cook [16] evaluated the channel symbol error rates of the 
single-tone and sixteen-tone modems prior to the decoding processor. These results 
showed that the multitone modem provided the better performance, but since this 
modem does not include FEC, its overall performance fell short of the simpler single¬ 
tone design. Such conclusions motivated the application of trellis-coded modulation 
approaches designed to maximize the euclidean distance among code symbols. An 
outgrowth was the development of the TCM-16 modem, which incorporates a spe¬ 
cific error correction code 3 and a convolutional interleaver [17]. In CCIR channels, 
this modem has been reported to provide excellent results. 

The success of TCM-16 in simulation has led to more advanced designs that more 
fully utilize the available channel bandwidth. Australians Defence Science and Tech¬ 
nology Organisation (DSTO) developed a 52-tone modem manifesting an end-to-end 
data rate of 4800 bps. Gill [18] presented the results of its performance on the CCIR 
“Good” channel compared with a single-tone modem at half the data rate; with no 
interleaving, for the same error rate, 2 dB less signal-to-noise ratio was required despite 
the increased data rate. When approximately 10 s of interleaving was included, the 
SNR requirements appeared to be equivalent, although one should not forget the data- 
rate difference. Attempts to compare performance with equivalent data rates led to test 
results indicating a 5 dB advantage for the 52-tone modem when approximately 10 s 
of interleaving is utilized. 

3.6.3 Automatic Link Establishment (ALE) 

A second government standard, MIL-STD-188-l4l(A,B,C) [20] focuses on the chan¬ 
nel protocols used for HF communications. This includes many details concerning 
channel use, including the automatic link establishment (ALE) protocol that has 
become important in todays digital communications era. This section provides a brief 
outline of ALE and gives some indication of the communications characteristics that 
can be expected when ALE is used. 

The ALE protocol evolved from the strictly manual approach to HF commu¬ 
nications that has been used since the beginning of long-haul radio. Because link 


3 256-state Schlegal-Costello code 
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quality is so uncertain from the two perspectives of ionospheric support between a 
transmitter/receiver pair and the presence of cochannel interference on any specific 
frequency, it has always been necessary to avoid strict frequency assignments. Instead, 
users have a list of frequencies at which they might transmit; control software operates 
the radio equipment in order to maintain routing tables that indicate which frequen¬ 
cies can provide a satisfactory link under current conditions. This process is facilitated 
using probe transmissions—more on them later. 

ALE embraces a standardized word format and a set of eight basic message formats. 
The ALE word consists of 24 information bits segmented into four parts, a 3 bit pream¬ 
ble and three 7 bit character fields. This word is split into two 12 bit halves, and each 
half is encoded using the well-known (12, 3) Golay code into two 24 bit Golay words; 
the two Golay words are then bit-wise interleaved. Finally, a so-called stuff bit is added, 
making a total of 49 bits from the original 24. As a last contributor to error con¬ 
trol, the 49 bit encoded ALE word is repeated three times before it is modulated for 
channel transmission. It is noteworthy that since the ALE bit rate is 375 bps, the dura¬ 
tion of the ALE word transmission is 3 x 49/375 = 392 ms, all for 24 information 
bits! 

As indicated, there are three preamble bits used to specify one of eight messages. 
These are: 

1. TO , used to identify the first three characters of the destination station address. 
Additional address characters can be appended using the DATA and REP 
messages below. 

2. TIS , used to identify the first three characters of the sending station address. 
The address can also be extended, as with the TO message. It is used at the end 
of a transmission when a dialogue is in process. 

3. TWAS , used in the same way as TIS but signals the end of a dialog. 

4. DATA , used to extend the data field of a previous message or to convey 
information in a message. It is used in conjunction with REP below. 

5. REP , used to generate extended messages by duplicating the function of the 
previous message while transmitting different message content. 

6. CMP , used as a command to facilitate coordination and control. 

7. THRU , used in the scanning of a calling cycle with group calls. 

8. FROM , used to identify the address of the transmitting station early in a long 
ALE frame. Note that the calling station ID is otherwise only contained in the 
TIS and TWAS messages, which may come long after the message start. 
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As a side note, the motivation for the stuff bit should be mentioned. Many of the 
modems used to implement ALE use 8-ary or 16-ary modulation. Without this bit, 
the same modulation symbol (the same tone in the case of MFSK) would be used 
to transmit a given bit in each of the three data-redundant repetitions. Inserting this 
bit causes symbol boundaries to precess through the message, thereby adding some 
diversity to the modulation scheme. 

These eight ALE message types can be used to construct many varieties of ALE 
frames and thereby implement a wide variety of operating protocols. In fact, such 
protocols have been developed to implement a number of networking protocols, such 
as those included in the government standard MIL-STD-187-721, which covers the 
network layer of the overall communications protocol. Although a thorough discussion 
of this topic is well beyond the scope of this work, it is appropriate to describe two 
basic protocol components, channel evaluation and call initiation. 

Channel evaluation is essential to all ALE protocol implementation because it pro¬ 
vides a framework for a station to initiate communication with a high probability of 
success. All ALE controllers are required to maintain a database that enables them to 
select frequencies with good channel support for communication with potential mes¬ 
sage recipients. Such databases are built in two ways, scanning and sounding. Scanning 
is implemented by stations not actively engaged in a communications dialogue. Each 
station is provided with a list of assigned frequencies for ALE transmissions. If this 
set is well chosen, at least one of the frequencies will be suitable for transmission to 
any other station with which it might want to communicate. Such an inactive station 
should scan its receiver over its frequency list, dwelling long enough to capture trans¬ 
missions by candidate communicants. Such messages may or may not be intended for 
the inactive station. If intended for it, the station can respond and become active; if 
not, the station can update its database, entering signal quality information to assist 
in frequency selection when it desires to transmit to the station corresponding to the 
heard message. 

Sounding consists of brief signal transmissions using an ALE structure to pro¬ 
vide empirical testing of potential channels and propagation paths. These messages 
are effectively self-identifying beacons that should be transmitted periodically by all 
stations. Other stations receiving these sounding messages can use them to supplant 
the database entries obtained with scanning. In all these cases, the receiving station 
can monitor channel conditions, interference, distortion, and so on, using standard 
ALE signals to provide link quality analysis (LQA) that will assist in later frequency 
selections. Often, the message used for such sounding is the TWAS , which explicitly 
indicates that no reply is expected. 
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Call initiation is relatively straightforward. The caller begins the process by selecting 
an appropriate frequency using the database just described. After checking for interfer¬ 
ence on the channel, the caller transmits a single-channel call. If the recipient is known 
to be listening on the chosen channel, perhaps by arrangement, the caller can send a 
very brief TO message with the called station address, followed by the TIS message, 
which expects a response. Otherwise, a substantial number of the TO messages pre¬ 
cede the TIS message, in hopes that the called station will capture one of them in its 
scanning. The calling station then waits for a response. The wait must be long enough 
for the called station to receive the initiating message, to tune its transmitter to the 
received message frequency, and to reply. If the original calling station does not receive 
a reply after this wait, the call is deemed to have failed, and the process is repeated, 
likely with a different calling frequency. However, if the called station does reply, the 
two stations are now communicating, and the desired message traffic can begin. 

3.7 Summary 

This chapter has described the more important modulation approaches used for com¬ 
munications at HF. Modulation methods appropriate for both data and analog signals 
have been included. With this material, as well as the summary of HF propagation phe¬ 
nomenology provided in the previous chapter, it is appropriate to address the principal 
focus of this book: the architecture and processing techniques associated with adap¬ 
tive array processing that have proven to be effective when receiving signals in the 
congested HF environment. 

3.8 Appendix: Morse Code 

Before leaving the amplitude modulation topic, it is appropriate to discuss Morse code, 
perhaps the best-known OOK protocol that has great historical significance [12]. This 
encoding approach is intended for the communication of text messages rather than 
binary encoded data and uses a larger symbol alphabet than does OOK. But it does 
employ an On-Off keyed transmitter. Morse code can be viewed as an approximately 
50 character alphabet utilizing five channel symbols: space, dot, dash, letter gap, and 
word gap. This code utilizes a sequence of short and long tones, referred to as dots 
and dashes , to represent letters of the alphabet; a longer sequence of concatenated 
letter sequences produces words. Relative timing is standardized such that a long tone, 
the dash, is three times longer than a short tone, the dot; an off interval, the space, 
equal in duration to that of a short tone, separates the tones in a letter, and an off 
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Letter 

Code 

Letter 

Code 

A 

• 

N 

• 

B 

— • • • 

0 

— 

C 

• 

P 

• • 

D 

■— • • 

Q 


E 

• 

R 

• • 

F 

• • — • 

S 

• • • 

G 

— • 

T 

— 

H 

• • • • 

U 

• • 

1 

• • 

V 

• • • 

J 

• — 

w 


K 

— • — 

X 


L 

• — • • 

Y 

— • 

M 

— 

z 

• • 


Figure 3.11 Structure of International Morse Code 


interval, the letter gap, equal to that of a long tone, separates the letters. A much 
longer off interval, the word gap, equal in duration to seven short tones, separates 
words. 

The set of characters used with International Morse Code is shown in figure 3.11. 
Note that this set includes the English alphabet, ten numbers, and various punctuation 
characters. Variations for languages other than English have been devised. 

The basic unit of timing is therefore the duration of a dot, the short tone. Trans¬ 
mission speed is controlled by adjusting the duration of the dot using the standardized 
word PARIS , which requires 50 dot units; a standardized transmission speed of twenty 
words per minute (wpm), or 1000 dots per minute, or 1000/60 dots per second, means 
that the dot duration is 60 ms. An appropriate bandwidth to assure clearly discernible 
Morse code is approximately equal to 100 Hz. 

These timing characteristics are most relevant to machine Morse transmissions. 
Most often, however, Morse code messaging is “manual Morse” keyed by a telecommu¬ 
nications operator; in such situations, although the relative timing must be maintained 
for intelligibility, there is wide variation in the absolute keying rate. Keying can be rel¬ 
atively slow; today, the lowest level of proficiency recognized by the American Radio 
Relay League, which monitors radio operator licensing in the United States, specifies a 
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keying speed of 5 wpm. However, Morse code keying capability is no longer required 
for an amateur radio operator license. Amateur radio operators have moved into the 
digital era and utilize computer-based digital transmissions with a wide variety of sig¬ 
nal formats [13]. It is noteworthy, however, that in the era when Morse code was the 
primary method for radio transmissions, keying competitions were held regularly, and 
the record for manual keying was officially 75 wpm. Thus, there is a wide variation in 
the signaling speed and consequently, in the Morse code signal bandwidth. 

Predictions of error rates for Morse code are complicated by the size of the char¬ 
acter set and the grammatical structure associated with the code design. Nevertheless, 
trained operators are able to “read Morse” at high speed even when the signal level is 
poor. With the introduction of digital modem technology, interest in Morse code has 
fallen off, and there appears to be little literature on collecting link performance data. 
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HF Receiver Architectural Approaches 


4.1 Overview 

Although the adaptive array processing of HF signals concentrates on the baseband 
processing of signal data captured by an array of HF antenna elements, a descrip¬ 
tion of the radio frequency (RF) portions of the receivers that provide this data is 
appropriate. This discussion will of necessity be brief; it primarily aims to describe 
the basic architectural options and to put all readers on a common footing in view of 
technological advances in recent years. Particular attention will be paid to the special 
receiver features associated with the multielement array application and the character¬ 
istics of modern analog-to-digital converters (ADCs), which translate the analog radio 
frequency signals to the digital world. 

This chapter will start with a brief description of four alternative receiver architec¬ 
tures, from the receiver input at the antenna port to the ADC, where the digital world 
takes over from its analog counterpart. Receiver issues, such as image control, dynamic 
range, and sensitivity, will be touched upon. An effort will be made to highlight design 
issues associated with maintaining coherence among the signals captured by a bank of 
receivers intended to operate with an array of antenna elements; such coherence is 
mandatory if a multichannel receiver is to supply appropriately conditioned signals to 
an adaptive array processor, thereby allowing this processor to provide the interference 
suppression performance promised by adaptive array techniques. 

Emphasis will be on multiple carrier HF receiver applications. This terminology 
is an import from the cell phone world in which a single base station implementa¬ 
tion receives simultaneous signals from many handsets operating on different carrier 
frequencies. Some cellular base station designs even allow the base station to operate 
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with several different signaling protocols. Furthermore, the substantial investment in 
a base station and the frequent revisions of cell phone signaling protocols motivates 
a flexible software-based processing system that can change easily without hardware 
replacement. In the HF world, whereas traditional receiver designs were intended to 
process a single signal arriving at a single receiver antenna from a distant HF transmit¬ 
ter, many applications motivating this book deal with the simultaneous reception of 
multiple signals, each using a different signaling protocol. As a result of these multi¬ 
signal requirements, the RF segment of a modern HF receiver must process wider 
bandwidths and address the high dynamic range issues associated with multiple sig¬ 
nals having different power levels as well as different carrier frequencies anywhere in 
the overall operational receiver bandwidth. 

4.2 Architecture Options 
4.2.1 Overview 

This section concentrates on the general architecture of an HF receiver from the input, 
which is assumed to be directly connected to the antenna output, to the ADC input. 
This portion of the receiver is analog in nature and often presents severe design chal¬ 
lenges. It must take the RF signal and condition it appropriately for digitization by 
the ADC. This generally means that the signal presented to the ADC must be at an 
appropriate amplitude for the system ADC—not too high so as to cause clipping or 
overranging nor so low as to cover only a portion of the ADCs digitization range, 
thereby losing receiver dynamic range. This requirement means that the arriving sig¬ 
nals must be amplified and/or attenuated appropriately, with the associated analog 
gain adjusted to provide the most advantageous signal range to the ADC. These latter 
requirements are usually addressed as part of the automatic gain control (AGC) design, 
a topic to be covered in this chapter as well. 

In the discussion to follow, it is important to recognize that many applications 
require phase information to be preserved and available for subsequent processing. In 
such cases, the receiver must produce coherent in-phase and quadrature outputs and 
is in effect a two-channel receiver with coherent quadrature local oscillators. As each 
receiver approach is presented, the additional complexity associated with producing 
these two quadrature channels will be discussed. 

Four receiver approaches will be presented [4], starting with the traditional 
superheterodyne receiver intended for a single narrowband signal. This description 
is included for background, providing a reference for comparisons with more 
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sophisticated designs appropriate to multiple carrier applications. Next, a single¬ 
channel receiver based on a digital intermediate frequency (IF) will be described, 
serving as a lead-in to the discussion of multicarrier superheterodyne receivers, the 
third approach to be presented. Finally, modern direct sampling receivers will be 
described. This approach has only become practicable in recent years as ADC technol¬ 
ogy has reached a level capable of supporting interesting performance specifications. 
Such a receiver can be set up to provide two quadrature data streams or a single complex 
data stream for subsequent digital processing. 

Each of these architectures will be described briefly, covering the overall frequency 
plan and its effects on image control. Because HF receivers operate in frequency bands 
that are crowded with signals, other signals at so-called image frequencies can be a 
major source of interference. Such interference can possibly be mitigated by an appro¬ 
priate adaptive beamformer but at the cost of reduced degrees of freedom for other 
in-band interferers. It is better to control these images by means of a carefully cho¬ 
sen frequency plan. Issues associated with such a plan are covered for each of the 
architectures to be described. 

4.2.2 Narrowband Superheterodyne Receiver 

This basic receiver architecture will be covered first because it provides a historical 
and technical foundation on which to address subsequent design options. However, 
in many applications its cost and performance factors are less attractive than those of 
more modern HF receiver design approaches. Nevertheless, its presentation is useful in 
highlighting some issues that must be addressed with the more sophisticated designs, 
in particular those associated with image control, AGC implementation, and coher¬ 
ence. In additon, for some applications, the superheterodyne receiver is still the most 
attractive option. 

Figure 4.1 illustrates a simplified design for the basic single-channel superhetero¬ 
dyne receiver. This design is narrowband and originally was aimed at receiving a single 
signal at an approximately known frequency, with potential Doppler shifts due to iono¬ 
spheric interactions or platform motion as the only sources of carrier uncertainty. The 
first step in the receiver plan involves limiting the frequency range reaching the first 
amplification stage. Although the antenna itself is likely to manifest a band-limiting 
filtering action, this antenna characteristic is poor or insufficient for most applications. 
Thus, the receiver will include a band-limiting filter specifically designed for the appli¬ 
cation, inserted prior to the first stage of amplification and implemented with a 
low-noise amplifier. This filter eliminates any strong out-of-band signals that might 
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Figure 4.1 Basic single-channel superheterodyne receiver block diagram 

be impinging on the antenna; if not filtered out, such signals could overwhelm the 
dynamic range of the low-noise amplifier. 

A tuning mixer follows the band-limiting filter, with the reference frequency 
selected to place the signal of interest squarely in the center of the intermediate fre¬ 
quency (IF) bandwidth; for a single signal receiver, the IF is chosen to match the signal 
bandwidth with sufficient margin to accommodate any carrier uncertainty. As will be 
addressed in the presentation of multiple carrier receivers, if the receiver is designed to 
capture multiple signals in contiguous frequency channels, the IF bandwidth must be 
made sufficiently wide to cover the intended set of signal bandwidths. An IF filter fol¬ 
lows the mixer; this filter rejects not-of-interest mixer products. These include signals 
at twice the carrier frequency but also include signals close to the signal of interest, 
which are not adequately attenuated by the band-limiting filter in the front-end. An 
IF amplifier follows the IF filter, providing not only isolation from the baseband stages 
but also implementing AGC. 

The final major analog section of this basic receiver converts the IF signal to its 
quadrature baseband components. This is accomplished with a pair of quadrature mix¬ 
ers fed by a fixed local oscillator that generates two quadrature references at a frequency 
equal to the IF. These feed a pair of mixers, each of which processes the same IF signal 
after it has been split into two identical IF copies. The outputs of these two mixers are 
the in-phase and quadrature baseband channels; each of these is passed through a low 
pass antialiasing filter to reject the double-IF frequency components. 

Finally, the in-phase and quadrature channels are passed to a demodulation section. 
Although this processing was traditionally performed with analog circuitry, more mod¬ 
ern receivers digitize these signals with separate baseband ADCs, and the resulting 
digital baseband signals serve as input to a digital signal-processing (DSP) section. 
Details of the baseband demodulation processing depend on the modulation employed 
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by the signal of interest; these demodulation details will be covered further in a later 
chapter. Today, however, even relatively low-cost receivers avoid separate in-phase and 
quadrature channels and employ direct digitization of IF signals, as discussed in the 
next subsection. 

It is appropriate here to comment about two aspects of a receiver that is intended 
for use in an adaptive antenna array application. As mentioned earlier, a separate 
receiver is needed for each antenna in the array. First, it is essential that these receiver 
channels are fully coherent with one another. That implies that all local oscillators 
are derived from the same source and the ADCs are all clocked simultaneously. 
Such coherence requires great care in the local oscillator signal generation and in 
the distribution of these local oscillator signals to their destinations on known, or 
preferably equal-length, paths. Second, the baseband processor can apply adaptive 
phase and amplitude weights to signals in the various array channels and coherently 
combine them, providing a beamforming capability that will be described in great 
detail later in this volume. This beamforming is only possible under the conditions 
of full coherence and digitizing simultaneity. Since beamforming is one of the pri¬ 
mary motivations for adaptive array processing, the importance of coherence cannot 
be overemphasized. 

In summary, the basic receiver described here uses a single frequency conversion 
stage before quadrature sampling. Often, additional up- or down-conversion stages 
are used to improve filtering and image rejection. As will be discussed in section 4.2.5, 
the simplest receiver architecture employs front-end filtering, followed directly with 
quadrature filtering. 

4.2.3 Digital IF Receiver 

t 

The second receiver architecture to be described is also intended for a single signal, 
but it exploits ADC and digital signal-processing technology that has been available 
since the 1980s, digitally sampling the intermediate frequency signal with a single ADC 
sampling at an appropriate rate and processing this single digital stream to produce the 
in-phase and quadrature signals needed for demodulation. The principal advantages 
of this approach are the accuracy with which the quadrature mixers can be imple¬ 
mented using digital signal-processing methods and the highly selective filters that 
can be implemented in the digital signal-processing domain. Although this receiver 
approach provided the signals for the first demonstration of adaptive array processing 
at HF, the costs of a multichannel receiver that can only process one signal at a time 
are unlikely to be justifiable. Nevertheless, this design approach is a stepping stone to 
the multiple-carrier receivers described in the next section. 
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Figure 4.2 Single-channel digital IF receiver block diagram 


Figure 4.2 shows this approach schematically. The band select filter, mixer to IF, 
and IF passband filter are used in a way that is similar to their use in a superhetero¬ 
dyne design, but the intermediate frequency is chosen in conjunction with the desired 
sampling rate and signal bandwidth to allow an efficient DSP implementation of 
bandpass sampling. Then, a controllable gain amplifier, which implements an AGC, 
provides an appropriately scaled signal to a single ADC, the sample rate of which 
is chosen carefully as just indicated. In particular, if the intermediate frequency is 
chosen to be an odd quarter (e.g., one-quarter, three-quarters, or five-quarters) of 
the sample rate, a pair of quadrature mixers can be implemented as simple multi¬ 
plication by zeros and plus/minus ones, simplifying the desired down-conversion to 
baseband [4], The outputs of this pair of mixers are known as the digital in-phase and 
quadrature, or digital I/Q channels of the receiver. This approach can provide perfor¬ 
mance that is superior to that of a superheterodyne design due to an effectively perfect 
match between in-phase and quadrature mixing implemented through digital signal 
processing. 

Although the focus in this section is a receiver for a single signal, this approach to 
receiver design is fundamental to modern FFF receiver design. For this reason, it is 
appropriate to delve into this receiver approach in some detail. Consider the signal 

x(t) — a{t) cos[2nf c t + <p(t) ], (4.1) 

where both a{t) and are assumed to be low-pass with one-sided bandwidth equal 
to B/2. Using basic trigonometric relations, this can be written as 

x(t) —xj(t) cos{2rcf c t) — Xq(t) sm(27tf c t), (4.2) 


where 


Xd{t)=a(t) co s[0(f)] 


( 4 . 3 ) 







i : 
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is the so-called in-phase component, and 


Xq{t) = a(t) sm[(/)(t)] 

(4.4) 

is the so-called quadrature component. These components allow 
be defined: 

a complex envelope to 

x(t) = xj(t) + ix q (t) 

(4.5) 

such that 


x(t)=Re[x(t)e i2n ^l 

(4.6) 


where i — \f— I. Because both the direct and quadrature signals are real and low-pass 
with bandwidth 5/2, each can be represented by a set of samples at a sample rate at 
least equal to B. Since there are two signals to be sampled, the total sample rate must 
be at least 2B, which is consistent with the well-known sampling theorem. 

The digital IF receiver described in this section employs a frequency downshift that 
places the bandpass spectrum entirely on the positive frequency axis but as close to 
the frequency origin as possible. Since it has been assumed that the one-sided band¬ 
width of the RF signal is equal to 5/2, a downshift using a mixer and a local oscillator 
with a frequency equal to f c — 5/2 will be employed. Figure 4.3a shows the origi¬ 
nal signal spectrum, and figure 4.3b shows the signal spectrum after the downshift. 
This process is implemented in the same way that the shift to IF is implemented in a 



(a) RF spectrum 



-f 0 f |F = B/2 f 

c ,r c 


(b) Digital IF signal spectrum 


Figure 4.3 Spectral illustration of digital IF receiver 
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superheterodyne receiver, with a single mixer and a low-pass filter. The resulting down¬ 
shifted signal is quadrature sampled using a single ADC that samples at a rate equal 
to f s — 4(5/2) = 25. The motivation for this sample rate will be derived next. 

The bandpass signal x(t) defined in equation (4.1), when mixed with a local 
oscillator signal designed for the desired downshift, is given by 

x(t) — a{t) cos[2nf c t (p (t)\ cos[2n(f c — B/2)t] 

= [a(t)/2]{cos[4jvf c t — 2 tt (B/2) t + <fi (t) \ + cos[2jr(B/2)t + 0(^)]}. (4.7) 

When this digital IF signal is low-pass filtered, the first term is suppressed, leaving 

x(t) — [a(t)/2]{cos[2jr(B/2)t + (j)(t)]}. (4.8) 

This signal is sampled at a rate equal to four times the IF frequency; that is at f s — 
4(B/2) = 25, providing a sample stream given by 

s x (n) = [a(t)/2]{cos[27t(B/2)n/f s + 

= [a(t)/2 ]{cos[2tt(B/ 2)n/f s ] cos[0(^)] — sin[2jr(B/2)n/f s \ sin[0(^)]} 

— [a(t)/2]{cos[mt/2] cos [ 0 (^)] — sin[7OT/2] sin[0 (£)]}. (4.9) 


Observe that the samples corresponding to even n capture the in-phase compo¬ 
nent of the signal, while the samples corresponding to odd n capture the quadrature 
component. Hence, 


I [a(t)/2]{cos[rm/2] cos[0C)]} = \x^{t) for n even 

i (4d0) 

— [a(t)/Z]{cos[(n-\r 1)tv/2\ sin[0(/■)]} = \xq(t) for n odd. 

Noting that the argument of the first trigonometric term is an even or odd multiple 
of 7r/2, this can be written 


|UW/2]{(-l) 7 ' /2 cos[0OO]} = \xd(t) for n even 

l-[4f)/2]{(-l) ( " +1)/2) sin[0(O]} = W,t) for n odd. 



Referring back to equation (4.3), it is apparent that the direct components of a complex 
signal representation of x(t) are provided by the even samples, while the quadrature 
components are provided by the odd samples, with appropriate use of ±1 in both cases. 

These narrowband, single-signal receiver descriptions provide helpful background 
for the following discussion of two important wideband receivers for multiple-signal 
reception. 
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4.2.4 Multiple-Carrier Superheterodyne Receivers 

A superheterodyne receiver for the reception of multiple signals is fundamentally sim¬ 
ilar to the same receiver type designed for a single signal; the principal difference is 
the receiver bandwidth. In order to provide coverage of multiple signals, this receiver 
characteristic must be expanded from a few to several hundred kilohertz or more. 

Figure 4.4 shows this architectural approach schematically. A comparison with 
figure 4.2 shows that the primary difference is that the sampling clock rate and the 
corresponding intermediate frequency are determined by the total signal bandwidth, 
rather than by the single-signal bandwidth. Because the multiple-signal receiver is 
intended to cover a fairly large number of HF signals with one tuner setting, this means 
that the digital IF is now on the order of 0.5 MHz or more (rather than 2-3 kHz), and 
the sampling frequency is four times this frequency. In-phase and quadrature sampling 
streams are provided for each narrowband subchannel by the DSP section. 

Such an expansion in operating bandwidth brings on significant design challenges, 
primarily with respect to the filtering. The necessary filters must not only be wide, 
but they must have steep skirts in order to adequately limit interference from nearby 
unwanted frequency channels. Further, and perhaps most importantly, filters from 
all antenna channels must be closely matched and equalized. Often, this matching 
is provided by a digital filter designed specifically for each channel to correct for any 
channel-to-channel differences. Because they will appear to be spatial, any such differ¬ 
ences, if unequalized, will compromise the level of interference suppression achievable 
through adaptive beamforming. 

As a result of these design challenges, the analog HF system designer faces these 
conflicts: 1) choose a single design to be used anywhere in the HF band, accepting the 
bandwidth limitation just outlined, with the implication that many such receivers will 
be needed to cover the complete HF band; or 2) utilize several different designs, each 



Figure 4.4 Multiple signal digital IF receiver block diagram 









98 Chapter 4 


• \ : % ' v . . ' •. 

V v. •' "• ■ '! ’7/-’ '/ , /.••H ' 

<.> c - '-** •■••V-v.'-v.v'-.' vh-'V *>*.• ••X'' -i- - *.• 1 .. . • • 

- ,--- - , - ► 

0 1 2 3 4 f c (MHz) 

Figure 4.5 Bandwidth-limiting constraints for a digital IF receiver 

with a challenging and costly high-quality analog filter requirement. Remember that 
the set of receivers needed to cover the intended HF bandwidth must be replicated 
to cover the set of antennas needed to support adaptive antenna processing; this fac¬ 
tor adds stressful matching requirements to the filter specification. Practically, as the 
bandwidth increases beyond a few hundred kilohertz, the analog filtering requirement 
becomes so difficult it is more advantageous to switch to an architecture that utilizes 
digital processing to an even greater degree. 

In summary, the high cost of good-quality analog filters, the difficulty in matching 
their frequency response (in both amplitude and phase), and the desire to miniaturize 
the RF receiver portion of the entire HF multichannel receiver system all argue for an 
RF design that maximizes the digital processing. Such an approach is the topic of the 
fourth receiver architecture to be described, the direct-conversion digital receiver. 

4.2.5 Direct-Conversion Receiver 

When adaptive array antenna technology was first applied to HF communications 
signal reception, high-dynamic range analog-to-digital converter (ADC) technology 
was limited to dynamic ranges of about 85 dB (14 bits) at sample rates in the range of 
5-10 Msps. More recently, however, these characteristics have improved to the point 
where dynamic range is close to 100 dB (16 bits) at sample rates exceeding 100 Msps. 
Such capabilities have led to the development of a direct conversion receiver for HF 
applications. Progress in ADC technology is detailed further in section 4.4.6. 

Figure 4.6 shows this approach schematically. This receiver architecture consists of 
a band-limiting filter to restrict signal inputs to those in the HF band or most often to 
a portion of this band. This filtering is critical if the receiver is not to be overloaded by a 
single strong signal, which is outside the band of interest, impinging on the system 
antenna. Following the filter, a high-dynamic range amplifier increases the signal level, 
conditioning it to an appropriate level for the ADC; this amplifier should provide an 
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Figure 4.6 Multiple signal direct conversion digital receiver block diagram 

adjustable gain capability in order to implement an AGC subsystem. Often, balanced 
analog circuits are used prior to the digitizing stage to suppress even order harmon¬ 
ics and distortion. The ADC is clocked at a high-enough rate to satisfy the Nyquist 
criterion for the RF signal; that is, at least twice the highest HF carrier frequency of 
30 MHz, allowing for filter rolloff; an ADC clock at 80-100 MHz is often used. The 
ADC samples are then transferred to the digital signal-processing section, where they 
are appropriately processed to develop the desired signal-baseband products. 

The simplicity of this architecture—and its reliance on digital signal processing— 
brings several important advantages. As with the digital IF architecture introduced 
in section 4.2.3, there is no requirement for well-matched quadrature analog mixers; 
in fact no analog mixers of any kind are required. Further, there are no intermedi¬ 
ate frequencies with their associated filtering components, eliminating a major source 
of antenna channel mismatch and unwanted images that compromise performance in 
adaptive antenna receiver configurations. Despite these advantages, care must be taken 
with the band-limiting filter design in order to control undesirable effects due to in- 
band harmonics. Such harmonics are more of an issue with HF receivers because 
the desired receiver-processing bandwidth can be on the order of the RF carrier 
frequency. 

Bandpass filtering is one of several issues that can be addressed jointly under the 
rubric of frequency control. Four issues dominate the discussion: 

1. Control of interference from strong unwanted signals and harmonics 

2. ADC performance 
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3. System data transfer and storage capacity 

4. Digital signal-processing capacity 

To understand the interrelationships among these issues, consider a design based on 
an ADC that can digitize at 80 Msps. Such a device could directly provide a stream of 
samples adequate to cover the full HF band from 2 to 30 MHz. If the only bandpass 
filtering was designed for the full HF bandwidth, a strong signal anywhere in the HF 
environment could dominate the ADC input range, thereby limiting system sensitivity 
over the full HF band. The AGC approach must address this issue. On the other hand, 
if the HF band were divided into several subbands using a bank of adjacent filters each 
several Megahertz wide, a strong signal in one of the subbands would not compromise 
performance in the other bands. In such a design, a separate ADC would be devoted 
to each subband filter output. 

Generally, there would be a trade in selecting the number of subbands between the 
higher costs associated with many subbands, each requiring a subband amplifier, ADC 
and DSP, and the greater penalty associated with losing an entire subband due to strong 
signal interference if there were few subbands. The system DSP capacity would play 
into this design trade because the subbands in effect define an independent data acqui¬ 
sition system covering the frequency extent determined by the subband bandwidth. In 
each subband, one must digitally handle the full rate ADC output using digital filter- 
decimate processing and then transfer the signal data covering the subband bandwidth 
to a data store and/or a second stage of digital signal processing for further massaging. 
The subband DSP must be able to handle this processing load, which grows linearly 
with the subband bandwidth. 

4.3 Image Control 

Before moving away from the receiver architecture discussion, it is appropriate to 
define and discuss image frequencies, which are an important consideration in super¬ 
heterodyne receiver design [4]. First, what is an image frequency ? Recall that a mixer 
produces two outputs, one at the sum of the two input frequencies and one at the 
difference of these two frequencies. A typical receiver will use filtering to select one 
of these at its IF, whether that be an analog or digital IF. If a mixer is fed with a local 
oscillator at frequency^ and the output is chosen by means of a bandpass filter cen¬ 
tered at frequency f/p, f 0 is chosen (tuned) to place a desired signal^ within the IF 
bandwidth. Suppose this is done such that fy —f 0 —flF • If there is another undesired 
signal present in the environment,^, such that f u +f 0 —flF > then bothj^ and fy will 
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fall in the IF passband, with the undesired signal causing interference to the desired 
signal. This issue must be controlled by assuring that the undesired signal does not 
reach the mixer with sufficient strength to cause objectionable interference. 

This section briefly reviews each of the four receiver architectures from the stand¬ 
point of interference due to signals at the image frequency. 

4.3.1 Narrowband Superheterodyne Receiver Images 

To clarify the issues associated with image control, consider the specifics of the 
most common narrowband superheterodyne receiver, the ubiquitous AM radio. This 
receiver is designed to receive amplitude^-modulated signals with a carrier frequency in 
the range of 540-1650 kHz. Such receivers exploit the wide availability of standardized 
IF filters having audio signal bandwidth and tuned to a center frequency of 455 kHz. 
In order to shift the carrier to this IF, the radio utilizes a local oscillator that can be 
tuned over the range of 995 to 2105 kHz. Consider the situation when the desired 
signal is at 540 kHz and suppose there is also a signal in the environment at 1450 kHz. 
The local oscillator is set to 995 khz to shift the desired signal to the IF at 455 kHz. 
Because 1450 — 995 = 455, the mixer will generate two signals at 455 kHz, one due to 
the desired signal with a carrier at 540 kHz and another due to the interfering signal 
at 1450 kHz. This signal at 1450 kHz is known as an image; it must be controlled by 
appropriate filtering if the radio is to have acceptable performance. One can follow 
this reasoning further to determine that there is a range of image frequencies corre¬ 
sponding to the range of local oscillator tuning frequencies; this range of images is 
1450-2560 kHz. 

Of course, some of these frequencies fall outside the AM broadcast band, and as 
a result there may be no signals present to generate image-based interference. Never¬ 
theless, there is no quarantee that this is the case; it is more prudent to include image 
control in the design so that an image signal does not reach the mixer should such 
a signal be present. The approach taken to control such image signals in AM broad¬ 
cast band radios is a tunable bandpass filter tuned to the frequency of the desired 
signal as part of the RF amplification stage. Because a judicious choice of IF makes the 
image frequency in such a narrowband superheterodyne receiver fairly well separated 
from the desired frequency, its specifications are relatively straightforward to realize in 
practice. 

This approach can be modified to handle the range of frequencies associated with 
HF signals in a narrowband superheterodyne receiver. Because HF signals are generally 
narrowband and are assumed to have a bandwidth of approximately 3 kHz; that is, 
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quite narrow relative to the range of HF carrier frequencies, there are two options for 
the frequency plan. One is to place the local oscillator frequency below the carrier, 
and the other is to place it above the carrier. Suppose the 455 KHz IF from the AM 
radio domain is utilized to take advantage of the inexpensive IF filters available for 
AM broadcast receivers. A local oscillator below the carrier would tune from 1.545 to 
29.545 MHz, putting the range of images from 1.090 to 29.090. This implies that a 
channel select filter at RF would be at least 0.91 MHz from the desired signal, which 
in this case is narrowband with a bandwidth of approximately 3 kHz. Such channel 
select filtering presents no design problems. 

The alternative option, with the local oscillator above the desired carrier and the 
same IF of 455 kHz, implies that the local oscillator tunes from 2.455 to 30.455 MHz; 
image frequencies range from 2.910 to 30.910 MHz. Again, an RF channel select filter 
should not be a problem. The two narrowband superheterodyne receiver frequency 
plans are illustrated in figure 4.7. 

A more modern approach to HF communication receiver design is to place the 
first IF above the entire HF band. At their input, such receivers use a low-pass fil¬ 
ter, typically called a “roofing” filter, with a cutoff of 30 MHz, to reject images. This 
avoids the need for tuned bandpass filters, which are difficult to align and equal¬ 
ize. Figure 4.8 shows this approach, which is very commonly used in amateur HF 
receivers. 




Figure 4.7 Image frequencies for narrowband superheterodyne receiver architecture 
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Figure 4.8 High IF receiver architecture. 


4.3.2 Digital IF Receiver Images 

In a narrowband single-signal receiver having down-conversion in the first IF mixer 
and using a digital IF, the RF channel selection filter becomes a challenge; this is 
because the narrow bandwidth of the signal puts the local oscillator and the desired 
signal frequencies close together. In particular, the digital IF is at half the signal band¬ 
width (B/ 2), or at 2.5 kHz, say. This means that the local oscillator would be at the 
desired carrier frequency.±2.5 kHz; for a desired receive frequency of 5.000 MHz, the 
local oscillator would be at either 5.0025 or 4.9975. But then the image frequency 
would fall at 5.0050 or 4.9950. A channel select filter with a center frequency at 
5.000 MHz and a rejection null 5 kHz away would be nearly impossible to realize. 
Thus, a digital IF receiver that directly shifts to the digital IF frequency would severely 
suffer from image rejection shortcomings. 

Such issues could be avoided by adopting two IF stages, the first of which would 
mirror the frequency plan of the narrowband superheterodyne receiver covered in 
section 4.3.1. In this case, the first IF stage might use the same IF frequency as the 
narrowband superheterodyne receiver, but the second IF stage would adopt the digital 
IF approach. Thus, the receiver would employ a second local oscillator to shift the 
signal from the first IF to the second. If the first IF were at 455 kHz, for example, the 
second local oscillator would be at 452.5 kHz (or 457.5 kHz). Because the IF signal 
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paths are entirely contained within the receiver, there is little danger of a sufficiently 
strong image signal (at 450 kHz or 460 kHz) reaching the second mixer. In this way, 
the digital IF receiver architecture could be preserved without concern over image 
frequency interference. 

4.3.3 Multiple-Carrier Superheterodyne Receiver Images 

When the system requirements dictate that many contiguous carriers are to be captured 
by a single superheterodyne receiver, the increase in target bandwidth changes the 
frequency plan options. In particular, there is insufficient spectral space between the 
low end of the HF band at 2 MHz and a center frequency appropriate for IF sampling. 
This forces the frequency plan to use an IF above the HF band, perhaps at 100 MHz 
or even higher. Such a high IF reduces the challenges in filter design and provides an 
opportunity for excellent control over image frequency interference. 

In particular, suppose an IF at 50 MHz was chosen. The local oscillator would tune 
between 52 and 80 MHz to cover the HF band; the corresponding image frequencies 
would fall in the range 102 to 130 MHz, well separated from the HF band. Alterna¬ 
tively, the local oscillator could be placed above the IF, tuning over the range 102 to 
130 MHz, with image frequencies between 202 and 230 MHz. Both options are well 
away from the HF band, allowing a low-pass filter with a cut-off somewhat above 
30 MHz to be used for image frequency control. 

4.3.4 Direct-Conversion Receiver Images 

Because image frequency issues arise from undesired mixing products, the direct con¬ 
version receiver, which has no mixers, is relatively immune to the concerns addressed 
for the other receiver architectures. Nevertheless, the presence of out-of-band RF sig¬ 
nals at the ADC input would result in undesired interference. Thus, it is important 
that the HF band filtering be done well, even if not because of image frequency prob¬ 
lems. Subsequent digital filtering in the DSP section of the receiver would have little 
value in suppressing this interference, which may be aliased from outside the HF band 
into the digitized signal by the ADC. 

4.4 Analog-to-Digital Converters: Issues and Performance 

ADCs provide the interface between the traditional analog receiver world and 
the digital signal-processing approach that dominates HF receiver design today. 
Enhancements in processor capability and coincident improvements in ADCs have 
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revolutionized this technology. Furthermore, these substantially greater capabilities 
are necessary for practical and affordable implementation of the multichannel systems 
that are the primary focus of this book. 

Two ADC characteristics dominate selection discussions for HF receiver applica¬ 
tion, speed and dynamic range. Thirty years ago, when adaptive array processing of 
HF signals was first demonstrated, ADC technology was hard-pressed to provide a 
14-bit device with a sample rate of 2 Msps. Today, several 16-bit ADC devices are 
available with sample rates exceeding 100 Msps. Furthermore, the raw analog and dig¬ 
ital dynamic range is about 100 dB. Such dynamic range characteristics are necessary if 
both a strong in-band signal and a much weaker signal are to reach the ADC simulta¬ 
neously yet be separated using a combination of spectral, spatial, and temporal digital 
processing. The second half of this book addresses digital signal-processing approaches 
that can accomplish such separation and allow each signal in the frequency cover¬ 
age of the receiver with sufficient differences to be processed separately. Quantitative 
requirements for signal separation will be identified. 

As ADC technology appropriate for the HF receiver application became available, 
it was necessary to develop suitable testing methods, to standardize these, and to apply 
them uniformly to those candidate devices that seemed to meet the dynamic-range 
specification. Comparisons of candidates exercised under identical circumstances were, 
and still are, necessary. To facilitate this testing, a formal test bed was developed at Lin¬ 
coln Laboratory in the mid-1980s [1, 2, 3] and used for such standardized tests at that 
time; such testing continues to this day. Fortunately, many ADC vendors recognized 
the need for such standardized testing and as a consequence, developed their own 
test facilities capable of conducting the standardized tests. Furthermore, these vendors 
began to publish their test results in standard forms, allowing potential users to make 
meaningful comparisons among competing devices. This entire process has changed 
the way that ADCs are evaluated and chosen for a specific application. Figure 4.9 is an 
example of the spurious-free dynamic range (SFDR) test results that are now standard. 

Although this book is not intended to provide a detailed picture of ADC evaluation, 
it is interesting to review some of-the recommended test procedures and to illustrate 
these procedures with results on ADCs that are deemed appropriate for HF receiver 
application. It should be emphasized that any comparisons are neither endorsements 
nor criticism of any specific device. In fact, by the time the reader sees these exam¬ 
ples, newer and more capable devices will be on the market. The test procedures to 
be emphasized here consider two important ADC characteristics, residual error and 
spurious-free dynamic range. These characteristics focus on accuracy and linearity, 
both of which relate to dynamic range; the second important feature, speed, is implicit. 
As long as the device is tested at the intended application speed and at that speed meets 
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Figure 4.9 Example SFDR test results 


the dynamic range requirements with representative signals, the user should find the 
device satisfactory. 

4.4.1 Noise 

There are several sources of distortion in an ADC: quantizer errors, amplitude gain 
compression, and sample-time jitter. Each will be discussed briefly, starting with quan¬ 
tizer errors. Even a perfect ADC will not produce an error-free replica of the input 
signal due to the approximation made by quantizing the input signal. Figure 4.10 
illustrates the errors generated by the process of ADC quantization. This error 
can be assumed to be uniformly distributed across a quantization step. Under this 
assumption, the mean of the error will be zero and its variance will be 

a 2 = V^/12, (4.12) 

where Vq is the quantizing interval associated with the least significant bit of the ADC; 
that is, the quantization step. This idealized quantization provides a reference against 
which an ADCs actual error can be evaluated. For a full-scale sinusoid as input, with 
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x 'jQ -3 Digitizing error for cos 2 Hz wave: 8-bit sampling at 1 kHz 



Figure 4.10 Example of signal representation errors due to quantization 


a peak-to-peak amplitude of V r volts, the input signal power is given by 

P FS = {V r /2) 2 /2Z, (4.13) 

■ -• K 

where Z is the input impedance. The quantization noise power associated with the 
ADC, Pq , is equal to the variance given in equation (4.12), so that Pq — <J 2 • 

Such measurement of the actual error can most conveniently be done using a sine 
wave input. If output ADC samples are examined, differences between them and 
the input sine wave can be used to determine how closely the actual error variance 
compares to the idealized variance identified in equation (4.12). This comparison 

provides the basis for a “bits lost” characterization, 

b L = log 2 ( V q(eff l V q ) = log 2 (Vna eff / V q ) . (4.14) 

From this equation it can be seen that doubling the effective quantization interval 
results in one lost bit. 
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The power relationships in equations (4.12) and (4.13) lead to a well-known expres¬ 
sion for the ADC signal-to-noise ratio. If one assumes that performance is only limited 
by the noise due to quantization (and not by distortion) and an N-bit ADC is used, 
then the signal-to-noise-plus-distortion, commonly known as SINAD , is given (in 
decibels) by 


SINAD =10 log 10 [PFs/Pq\ = 10 log 10 


l(V q • 2 n )/2?/2R 
F//12Z 


= 6.027V +1.76. 

(4.15) 


This expression is often used to determine the number of ADC bits needed to achieve 
a desired dynamic range. 


4.4.2 Distortion 


When the circuitry used to precondition a signal for presentation to the ADC exhibits 
nonlinearities, the result is distortion that compromises overall receiver performance. 
Figure 4.11 illustrates how the transfer function of this circuitry might introduce dis¬ 
tortion in the signal presented to the sampling switch, thereby producing distortion in 
the digitized signal. The ideal transfer function is linear; that is, the output is directly 
proportional to the input, and the output manifests a straight-line relationship to the 
input. But the aforementioned linearities may modify this relationship, and the actual 



Figure 4.11 Nonlinearities and resulting distortion 
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Figure 4.12 Simplified ADC block diagram 

input-output relationship will deviate from a straight line. As shown in the diagram, 
the result is undesired distortion in the output. Efforts to mitigate this distortion are 
discussed briefly next. 

Distortion can be the result of amplitude gain nonlinearities or sample-time jit¬ 
ter. To better understand these distortion components, consider the simplified block 
diagram of a typical ADC shown in figure 4.12. The analog input is conditioned by 
an operational amplifier and then presented to a sampling switch. This switch closes 
briefly at the proper sampling time, allowing the holding capacitor to charge. After a 
suitable charging time, the switch opens and a quantizer converts the voltage on the 
holding capacitor to a digital word. This process is repeated periodically at the desired 
sampling rate. Although detailed discussion of this circuitry is beyond the scope of 
this book, it is noteworthy that the sampling switch is a transistor in which the signal 
itself can modulate the on-off transition voltage, leading to distortion in the sampled 
data stream. 

Amplitude nonlinearities can also be introduced by the input low-noise amplifier 
or by the switch that is in series with the holding capacitor. If the capacitive impedance 
is high; that is, the holding capacitance is small (Z c = 1/coC is large), the voltage com¬ 
ponent across the switch will be dominated by the voltage across the capacitor, and 
the quantizer will follow rapid variations in a wideband signal. This design approach 
is in conflict with the need to maintain a constant voltage on the capacitor during 
the hold-and-digitize state and presents a real challenge to the ADC development 
community. 

Another important distortion issue concerns the negative-to-positive transition in 
the ADC transfer function. This corresponds to code transitions associated with input 
levels near zero and is described as crossover distortion . Midway through a digitizers 
range, the output code transitions from the code Oil... Ill to the next higher level 
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corresponding to the code 100.. .000, at which point all code bits must switch simul¬ 
taneously. If this is not implemented perfectly, distortion in the output stream will be 
apparent, and SFDR performance will suffer. 

4.4.3 Sample-Time Jitter 

Sample-time jitter is another limiting effect in ADC design and application. Consider 
a sine wave input to an ADC such that the k-th sample occurs at time kx. If kx is in 
error by 5, this sample can be represented by the expression 

Sfc — Acos[co(kx -f- 8) T 0] -f (4.16) 

which can be expanded to produce 

[Acos(ookx + 0) + n(k r)] — co8Asin(cokx + 0). (4.17) 

The bracketed term is the desired sample value while the term proportional to 8 
represents the error due to sample-time jitter. Note that the error component is in 
quadrature with the desired sample. The goal in selecting a timing source should be to 
keep this timing jitter error less than half the quantization level, or 

(2nf8A) max < 0.5Q. (4.18) 

If the amplitude of the signal is conditioned such that its peak-to-peak value equals 
the range of the ADC, then A max = 0.5 V ry and V r = Q2 72 , where n is the number of 
ADC bits. In this case, 

2 7if 8 < 2~ n . (4.19) 

This equation establishes the number of bits needed to resolve the quadrature effects 
of sample-time jitter. It can be used to characterize ADCs by the technology level used 
in their manufacture. 

4.4.4 Spurious-Free Dynamic Range 

One of the primary shortcomings of ADCs is a limitation on dynamic range. If the 
signal presented to the ADC is strong, clipping or other nonlinearities in the device will 
cause spurious responses to be generated. When the input is a sinusoid, these spurious 
responses appear as sinusoids at harmonics of the input frequency. Although the level 
of these harmonics is usually low, it can be high enough to interfere with reception 
of other signals within the receiver-operating bandwidth. This ADC characteristic has 
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led to ADC tests with full-range sinusoids as inputs and spectral processing of the 
ADC outputs to make the low-level spurious responses observable. The spurious-free 
dynamic range (SFDR) of an ADC is the ratio of the input sinusoid power to the 
power in the highest spectral response at a frequency away from that of the input. As 
mentioned earlier, manufacturers conduct such SFDR measurements and present the 
results as part of the dynamic performance specification. Devices can exhibit SFDR 
levels in excess of 100 dB. Section 4.4.6 will review the progress that has been made to 
improve SFDR in recent years. 

When implementing an HF receiver with a digital output, care must be taken in 
conditioning the RF signal for presentation to the ADC. Two factors are most impor¬ 
tant. First, because the ADC device is single ended, the RF signal must be DC-biased 
to place a zero RF input at the midpoint of the ADC range. Second, it is undesirable 
to adjust the input signal level to the point where a large positive input drives the 
ADC to full scale, or a large negative input induces a negative out-of-range condition. 
Such strong input levels in the receiver gain plan would leave no headroom in the event 
that an unexpectedly strong signal, noise spike, or unfortunate combination of multi¬ 
ple strong signals impinges on the antenna. As noted earlier, an input voltage greater 
than the digitizer full-scale value will generate a particularly unfortunate sample value: 
as a positive input voltage increases toward this value, the first bit of the output sample 
will be a 0, reflecting a positive sample, and the remaining bits will increase as a binary 
count toward all Is. At the point where the output sample is Oil... 11, the maximum 
possible positive digitizer output, an incremental increase in input value will increase 
the output sample to 100...00, the most negative possible output. An equivalent sit¬ 
uation can arise with large negative inputs to the ADC. Clearly, these are undesirable 
situations, and they receive considerable attention from the ADC designer. 

Overranging code transitions are not the only issue with overdriving an ADC. 
It is also important to avoid those portions of the ADC transfer characteristic that 
induce spurious responses introduced by sample-and-hold circuit nonlinearities. Of 
course, these nonlinearities should not be a major factor in ADC selection. Instead, the 
receiver designer must consider where non-linearities are located on the transfer char¬ 
acteristic. The result of these considerations suggests that the signal be conditioned to a 
level well below full scale. This means that a significant amount of the ADCs dynamic 
range is typically unused but provided to allow for an unfortunate combination of 
inputs. 1 Note also that ADC nonlinearities can vary depending on signal dynamics; as 


1 Gaussian noise, an inportant component of the ADC input, has no statistical maximum value, but in practice a 
13 dB allowance for statistical noise variation is sufficient. 
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a result, the dynamic ADC performance specifications, signal-to-noise-plus-distortion 
(SINAD), and SFDR capture these effects more effectively than corresponding static 
specifications and should be used in ADC selection and preconditioning design. Such 
dynamic effects motivate testing with sinusoidal signals at frequencies representative 
of the application. 

As an example of a receiver gain plan, assume an ADC has an SFDR of 90 dB and 
a full-scale input level of +12 dBm (2.5v p-p into 50 ohms). The spurious level for 
such a full-scale input would be at —78 dBm (+12 — 90) at the ADC input, but if the 
receiver gain plan were based on keeping the strongest expected signal at 13 dB below 
full scale, the spurious level of concern would be reduced to —91 dBm. Potentially, 
such a spurious reponse could be mistaken for an actual antenna signal. If the receiver 
signal conditioning included 20 dB of signal gain, this would correspond to an antenna 
signal at —111 dBm. This spurious level can be compared with receiver internal thermal 
noise at the antenna, which for a bandwidth of 1 kHz is approximately —124 dBm. 2 In 
an environment with a strong signal equal to the design maximum, if internal receiver 
thermal noise was the only noise competing with signal reception, such a spurious 
response would become the limitation on receiver sensitivity. However, at HF, external 
noise can be substantially greater than thermal noise. In this example, a 10 dB increase 
in noise level due to external noise would result in a signal-to-noise ratio of only 3 dB 
on a spurious level signal, reducing concern over mistaking a spurious response for a 
real signal. Thus, the example design would appear to have a reasonable gain plan in 
which both the external noise environment and the spurious response characteristics 
of the ADC are balanced. 

Figure 4.13 plots the signal level at the ADC input as a function of antenna sig¬ 
nal level with the receiver gain as a parameter. The plot also includes horizontal lines 
showing the spurious level associated with SFDRs of 80, 90, and 100 dB. The asterisk 
indicates the situation discussed in the previous example. Figure 4.14 shows the signal- 
to-noise ratio associated with an antenna signal level for three levels of external noise 
above kTBF noise. From this plot and the one in figure 4.13, one can determine the 
sensitivity of the receiver. In the example, figure 4.14 shows that with 10 dB of external 
noise, a 10 dB SNR is achieved at —104 dBm. Reference to figure 4.13 suggests that 
with a conditioning gain of 20 dB and an ADC having an SFDR of 90 dB, there would 
be no issue operating at this level. If the ADC SFDR were 80 dB, spurious signals with 
an SNR above 10 dB could occur. 


2 kTBF = ~228.6(/c) + 24.6(7" = 290 deg) + 30(8) + 20(F) 
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SFDR/receiver gain analysis 



Antenna signal level (dBm) 

Figure 4.13 Signal level at ADC input—receiver gain variable 
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Figure 4.14 Signal-to-noise ratio vs. input signal level—external noise variable 
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This example shows why careful study of ADC manufacturer specifications is 
important when developing an appropriate gain plan using a particular ADC. The 
reader should take advantage of numerous application notes published by these 
organizations when planning the design of a modern digital receiver for use in the 
HF band. 

4.4.5 Dither 

Even a superficial examination of the digitizing process makes it apparent that inac¬ 
curate quantizer steps become a leading source of ADC performance shortcomings. 
Consider an input signal that spans several quantizer levels, some of which are not 
precise multiples of the quantizer step size, Q. If this input is a sinusoid, these inaccu¬ 
rate steps will be revisited with every cycle of the input sinusoid, leading to a distorted 
output waveform that contains many harmonics of the input signal frequency. These 
harmonics become the spurious responses discussed earlier. They become spectrally 
prominent and spectrally discrete due to the repetitiveness of the digitizing error. 
Dither is a commonly used approach to mitigating such repeating errors; it involves 
the addition of a random noise component to the input signal, thereby randomiz¬ 
ing the repetitive errors, spreading the energy in frequency, and reducing the strength 
of the discrete frequency spurious responses. 

Of course, an additive dither signal adds noise to the output and therefore com¬ 
promises the signal-to-noise ratio on the desired signal. Two techniques can be used 
to avoid this problem. The first of these is fairly simple: utilize a dither signal that is 
noise-like but has a spectrum that does not overlap the signal spectrum. The output 
signal can then be processed to filter out the undesired dither-related noise. Often, the 
quantized output would be spectrally processed even if there were no dither signal; in 
such cases, the dither signal can be filtered without significant overall processing cost. 
Care should be taken in selecting the dither signal. If its spectrum is dominated by 
much lower frequencies compared to those in the desired signal, the intended reduc¬ 
tion in repetitiveness will be compromised; if too high, the sampling rate of the ADC 
may be inadequate to provide the desired level of spectral resolution. Such a spectrally 
separate dither signal is therefore best used when the desired input-signal has gaps in 
its spectrum, and a dither signal is designed to fall in these gaps. 

The second alternative to mitigating dither-based signal-to-noise loss requires that 
the dither signal have a known but noise-like structure; that is, it is not truly random 
but is computable even though it has a noise-like amplitude and frequency variation. 
Such a dither signal has the desired effect of reducing the repetitive nature of the 
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Table 4.1 Historical picture of analog-to-digital converters for HF receiver applications 


Manu. 

Part # 

Date 

Bits 

Sample Rate 
(Msps) 

SINAD(dB) , 

SFDR(dB) 

AD 

1205 

1986 

12 

5 

61.4 

63 

BB 

ADC600 

1986 

12 

10 

65.6 

68 

AD 

CAV1220 

1986 

12 

20 

55.3 

54 

WE 

WEC1405 

1986 

14 

5 

67.8 

70 

El 

AD1512 

1986 

14 

5 

66.2 

74 

AD 

AD9014 

1990 

14 

10 

75.0 

72-84 

AD 

AD9042 

1996 

12 

41 

67.5 

80 

CO 

CLC949 

1996 

12 

20 

68.0 

73 

CO 

CLC935 

1996 

12 

15 

65.4 

81 

MX 

MAX1427 

1998 

15 

80 

73-76 

91-93 

AD 

AD6645 

2003 

14 

80 

75.0 

93 

TI 

ADS5422 

2005 

14 

65 

71.0 

85 

LT 

LT2009 

2007 

16 

160 

77.1 

105 

AD 

AD9467 

2013 

16 

160 

74 

100 

LT 

LTC2107 

2014 

16 

210 

80 

98 


digitizing errors but also to the first order has a predictable, and therefore remov¬ 
able, effect on the digitized output. The processing to remove the dither-induced 
error is similar to adaptive noise-cancellation schemes used in many modern sys¬ 
tems. In this case, adaptation is only necessary to match the noise-like signal being 
subtracted. 

Dither has become such a common component of the digitizing process that today 
ADC manufacturers provide device specifications with and without dither. Some even 
include dither generation and addition as a pin-selectable ADC feature. Usually, there 
is a significant improvement in the SFDR when dither is implemented or, in the case 
of devices with a built-in dither capability, turned on. 

When modern HF receivers are used to process more than a few hundred kilohertz 
of bandwidth, then the noise-like nature of the HF band introduces its own dither. It 
is not often that a single, or even a small number of HF signals, dominate the power 
received at the receiver input. As a result, the input is effectively Gaussian in nature, 
and the HF receiver can be described as “self-dithered.” 
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4.4.6 Recent ADC Progress 

In the time since interest in digital receivers for HF applications began in the mid- 
1980s, ADC technology has moved forward substantially. Whereas devices able to 
provide accurate 12-bit sampling at a 10 Msps rate or 14-bit sampling at 5 Msps repre¬ 
sented the state of the art for covering an acceptably large portion of the HF spectrum 
in that era, 14-bit devices able to sample the entire HF spectrum became available 
about 2003. These devices inspired new designs for HF receiver applications utilizing 
the digital receiver architectures described earlier in this chapter. Table 4.1 lists a num¬ 
ber of important ADC devices appropriate for HF receiver design in order of date of 
introduction. It is apparent that great progress has been made with ADC development 
and that today a digital HF receiver, which covers the entire band, can be built at an 
acceptable cost. 
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HF Array-Processing Architecture 


Previous chapters provided background material on the propagation of HF waves, 
common HF modulation techniques, and RF receiver architectural options. Now this 
book begins its primary topic, digital signal-processing techniques that exploit the 
performance enhancement opportunities afforded by a multielement antenna array. As 
indicated in chapter 1, such techniques provide significant advantages: 1) increased gain 
for greater receiver sensitivity, 2) enhanced cochannel interference-suppression capa¬ 
bility, and 3) opportunities for the coherent combining of ionospheric propagation 
modes to suppress multimode garble and realize greater signal-to-noise ratio. These 
techniques will be detailed in the chapters to follow. This chapter outlines the overall 
signal-processing architecture, with hints as to how the multielement array provides 
the leverage for performance enhancements. Subsequent chapters will delve deeply 
into signal-processing algorithms that provide the desired performance enhancements 
and quantitatively assess these benefits. 

Figure 5.1 shows the logical relationship among the key functions in an overall HF 
sensing-system block diagram. 

5.1 Detection 

Given a system input, detection can be defined as the first branch of a binary deci¬ 
sion: 1) Is a signal present, or 2) Is the current system input entirely attributable to 
background noise? Many detailed analyses have been performed to address this ques¬ 
tion as informed by basic probability theory. The question at hand could be restated: 
Is the input to a receiver a signal at a specific level, or is it just (Gaussian-distributed) 
noise obscuring the observation? Furthermore, the decision could be much more com¬ 
plex. When signal and interference are potentially present simultaneously and the 
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Figure 5.1 Overall HF sensing-system block diagram 
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environment statistics are not well known, as is common at HF, models to support 
a well-defined decision process become highly problematic. 

In the HF signaling environment, ionospheric propagation effects add substantial 
uncertainty to what the signal characteristics might be when reaching the receiver. 
Both channel noise and cochannel interference make the alternative to a signal-present 
decision difficult to model quantitatively as guidance to formulating a decision rule. 
When the receiving system is equipped with a multielement array having sufficient 
aperture, it can measure the spatial distribution of arriving energy and determine if 
the current distribution is different from the norm. If a spatial difference between 
signal and interference is apparent to the antenna array, the signal-present and signal- 
absent statistics will show a greater disparity, thereby providing a clearer rationale for 
declaring a detection. Such a process might entail making periodic energy distribution 
estimates, followed by a methodology for comparing the signal-present and signal- 
absent distributions resulting from this estimation process. When the distributions 
are sufficiently different, a new signal arrival might be declared. 

The general formulation of such decision making often uses a Bayes criterion, which 
leads to a likelihood ratio approach. The probability of observing the collected data 
under the hypothesis that the signal is present and the probability of observing the col¬ 
lected data under the alternate hypothesis that the signal is absent are both computed. 
Then, their ratio is compared to a threshold. Different a priori probabilities for the 
signal-present and signal-absent events, as well as for the costs of the various errors, 
can be used to adjust the threshold. If the cost differences are ignored, it is common 
to make the comparison 

Prob{z \, Z 2 , ... | signal present) ^ Prob(signal present) 

Prob(z \, Z 2 , ... |signal absent) > Prob(signal absent) 

When the left-hand side exceeds the expected a priori value for the ratio, a decision 
can be made in favor of the signal being present. 

As discussed in most texts covering decision making (e.g., [3]), there are several 
other approaches in addition to this Bayes likelihood ratio test technique. The Bayes 
technique can be extended to a minimax approach that chooses the a priori probabil¬ 
ities to make the decision process as beneficial as possible. Alternatively, a Neyman- 
Pearson criterion, which maximizes the probability of detection for a fixed false-alarm 
probability, can be utilized. This criterion also leads to a likelihood ratio test. 

Chapter 7 is devoted to detection. It is logically the first step in the overall pro¬ 
cess leading to the declaration of a signal of interest and consequent signal reports 



120 Chapter 5 


that contain detailed information about the signal and its direction of arrival. It also 
provides a useful vehicle for initial discussion of the signal-processing methodologies 
and benefits associated with antenna arrays. 

5.2 Direction Finding 

Direction finding (DF) is a process that produces a direction-of-arrival estimate for an 
incoming signal. In the HF signal-processing environment, DF methodology exploits 
a fundamental characteristic of an electromagnetic wave: whenever the observer is suf¬ 
ficiently far from a source, propagating electromagnetic signals travel as plane waves. 
This characteristic holds even for HF signals that have been refracted in their pas¬ 
sage through the ionosphere and are consequently bent downward toward Earth, 
where they might encounter a receiving antenna array. In the 1980s, this observation 
was not universally accepted; however, in the intervening years, extensive experi¬ 
ence with antenna arrays has provided sufficient evidence that such a model is accurate. 
The next chapter elaborates on how this plane wave model can be leveraged by array 
processing algorithms to suppress the effects of interfering signals and provide accurate 
line-of-bearing (LOB) estimates limited primarily by the size of the array aperture. 

Chapter 8 will quantitatively address the influence of sensor antenna aperture, 
observation time, signal frequency, and signal strength upon DF estimate accuracy. 
For the present, it will be useful to briefly discuss the so-called Rayleigh criterion , which 
describes the ability of an aperture, normalized to wavelength, to resolve two targets 
by their differences in direction. In the context of an RF receiving array, this process 
is equivalent to producing DOA or LOB estimates for each target. Consider a vector 
from the sensor phase center to the signal source and a plane perpendicular to this 
vector through the sensor phase center, as illustrated in figure 5.2. The effective aper¬ 
ture for estimating the LOB is the projection of the physical aperture onto this plane. 
If the aperture is linear, this will be a line; if two-dimensional, this projection will be 
planar. The Rayleigh criterion provides an estimate of the beamwidth associated with 
an aperture of a given diameter and therefore provides an estimate of the accuracy with 
which the angle between the vector and the plane can be determined. The Rayleigh 
criterion can be stated as: 

a e =l.22X/D, (5.2) 

where X is the wavelength associated with the signal carrier, and D is the diameter 
of the effective aperture; <jq is measured in radians. Thus, if the effective aperture 
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Figure 5.2 Line-of-bearing estimation geometry 
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diameter is 90 m and the signal frequency is 20 MHz; that is, the wavelength is 15 m, 
oq = 0.2 rad, or approximately 12°. If the goal is to achieve an LOB error of 3°, it 
will be necessary split the beamwidth by a factor of 4. This is feasible if the signal- 
to-noise ratio is sufficiently high. This beam-splitting process is discussed further in 
chapter 9. 

The parameter D requires some interpretation. As shown in figure 5.2a, if the 
aperture is a linear array, then D is the length of the array. If the aperture is planar, 
figure 5.2b suggests that D should be measured in the direction associated with the 
desired angular estimate. In the simple situation where the signal source is on the array 
perpendicular and the aperture is circular, then D is the aperture diameter, as expected. 
In less simplistic situations, the geometry of the array, the direction of the signal, and 
the angular measurement coordinate system orientation must all be accounted for in 
applying the Rayleigh limit. 

It is important to consider the HF propagation environment when interpreting 
these DOA estimation limits. Typically, an HF signal reaches a receiving antenna array 
after refraction by the ionosphere, arriving with an elevation angle typically between 
5° and 45°. For example, an array designed for sub-5° accuracy would require an 
aperture of eleven wavelengths; if this is to be achieved on a 10 MHz signal, the required 
projected aperture would be 330 m in diameter. From a practical standpoint, such an 
array must be horizontal. But if the goal is to estimate signal elevation angle, the 
projection of this aperture onto the plane described is then reduced by the sine of this 
angle, a severe penalty upon the measurement aperture, thereby reducing accuracy. 
The penalty associated with such an elevation angle estimation process is known by the 
term geometric dilution of precision because it results from the geometric relationship 
between the signal vector and the array aperture. Fortunately, there is no geometric 
penalty on a horizontal aperture’s ability to measure azimuth. 

5.3 Copy 

The term copy is a holdover from the early days of radio, when an operator would ask 
for confirmation of message reception with the query, “Do you copy?” In the modern 
digital world, the copy process is merely estimating, at the sensor, what was trans¬ 
mitted. Thus, one can alternatively use the term signal estimation when referring to 
“copy.” With a wideband array of multiple antenna elements, antenna gain and inter¬ 
ference suppression can be achieved by: 1) weighted combinations of array element 
signals, 2) tapped delay-line equalization, and 3) combinations of these techniques. 
The array element weights and/or the tapped delay-line weights can be adaptively 
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adjusted in response to the environment. Chapters 11 and 12 focus on adaptive forms 
of signal-estimation processing that utilize these three approaches. 

Although this definition of copy processing is simple enough, there are important 
issues when one desires to evaluate the effectiveness of a copy algorithm using an adap¬ 
tive array receiver at HF. First, there is channel uncertainty. A specific technique can 
be evaluated for a well-defined channel, such as the additive, white Gaussian noise 
channel; the performance of many demodulation techniques have been assessed for 
this channel [2, 3, 4]. However, at HF the channel model must account for iono¬ 
spheric variabiliity, which is dependent upon solar illumination, Earth magnetic fields, 
traveling ionospheric disturbances, and so on, as described in chapter 2. Under such 
time-varying conditions, demodulator performance is not easily modeled or analyzed. 
Some demodulation techniques have been modeled for a Rayleigh fading channel [6] 
and others for a Rician channel that manifests a specular component in addition to 
the Rayleigh fading [7]. These models can be used to characterize symbol errors when 
the channel statistics are well defined, but at HF any model parameters would vary 
slowly with time of day, and it would be necessary to conduct evaluations of copy 
performance for a range of these parameters. 

Second, an adaptive array-processing algorithm, intended to increase signal-to- 
noise ratio and/or suppress interference, is either in series with the demodulation 
processor or coupled to it in a joint array-processing/signal demodulation algorithm. 
Detailed models for the performance of coupled algorithms in time-varying channels 
are not generally available. 

Third, the operating scenario is important. Empirical assessments of demodulation 
performance have been carried out with known transmitted signals; that is, when the 
communications channel is cooperative. Some results of these experiments have been 
published [5, 8] addressing the relative reliability of various modulation techniques 
under some channel conditions. Equivalent empirical assessments for the noncoop¬ 
erative channel, in which the signals are unknown, are difficult because the receiver 
does not have a detailed definition of “truth.” Some notion of the expected copy per¬ 
formance can be estimated by: 1) characterizing the signal-to-interference-plus-noise 
ratio for the signal arriving at the antenna aperture, 2) estimating the level of interfer¬ 
ence suppression provided by the array processing, and 3) inferring the expected bit 
error rate obtained for the modulation method used by the transmitter using compa¬ 
rable empirical results for cooperative channels. Any performance estimates made in 
this way can be supported by measuring error rates on those channel symbols that are 
fixed in the transmissions; these would include call signs and training sequences used 
for equalization. 
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5.4 Reconstruction 

Reconstruction is an extension of the copy process. It is employed when the inter¬ 
ference environment varies across the bandwidth of the signal of interest. In such 
situations, the bandwidth occupied by the signal is divided into subbands, allowing 
different copy processing to be utilized in each subband. The copy process for each 
subband will provide a clean replica of each transmitted signal subband, and these 
subbands will require coherent combining to. obtain a complete estimate of the trans¬ 
mitted signal. Reconstruction is the term given to such recombining; it will be covered 
in more detail at the end of chapter 11. 

The motivation for reconstruction lies with environments that are dense with inter¬ 
fering signals. As will be detailed in chapter 6, an array with N antenna elements has 
N — 1 degrees of freedom; that is, the ability to place interference suppressing nulls on 
N — 1 interferers. If there are so many interferers that the signal of interest is obscured, 
there may be more than N — 1 interferers in the bandwidth occupied by the desired 
signal. If the composite signal at each antenna is processed into subbands, each cov¬ 
ering a portion of the signal of interest, then it is likely that fewer interferers will 
lie in each subband than in the total bandwidth. The number of subbands can be 
increased until no subband includes more than N — 1 interferers. As a result, the array 
will have sufficient degrees of freedom to handle this dense interference environment. 
Thus, reconstruction expands the power of the N-element array to handle more than 
N — 1 interferers. The number it can now handle will depend on the interferers 5 indi¬ 
vidual bandwidths and how the interferers are distributed across the signal-of-interest 
bandwidth. 

This increase in array capability brings five important challenges: 

Spectral analysis. First, the array signals must be divided into subbands, requiring 
a spectral analysis processing step that might otherwise be unnecessary. The form 
of this spectral-analysis processing may have to adapt to the density of interfering 
signals, requiring an environment assessment as a preliminary step. 

Multi subband array processing. Second, the array-based interference-suppression 
processing must be carried out separately in each subband. This means that a set 
of antenna element weights must be determined for each subbiand and the weights 
applied in each of them. This processing stage, and the spectral analysis preceding 
it, may bring on substantial additional processing costs. 

Subband combining. Third, the results of the interference-suppression processing in 
each subband are only a portion of the signal of interest living in that subband. For 
most signals, this will require that the subband results will have to be combined to 



HF Array-Processing Architecture 125 


provide a final interference-free, full bandwidth signal of interest. Techniques for 
this recombination processing, herein denoted reconstruction, are a research topic 
that will be introduced in more detail in chapter 11. 

Subband antenna weight determination. As will be seen throughout chapters 7 
through 12, some of the most effective interference-suppressing techniques exploit 
specific features of the signal of interest. Constant envelope processing and cyclo¬ 
stationary processing are two examples of these techniques. Once spectral-analysis 
processing has been introduced, breaking the signal of interest into subbands, the 
signal features are no longer available to exploit. More generic interference suppres¬ 
sion array-processing techniques must be used. Since these may not be as effective 
as the signal exploitation methods, there is a conundrum: reducing the number 
of interferers by means of subbands eliminates the most effective array-processing 
techniques for suppressing multiple interferers, requiring more subbands with fewer 
interferers in each, hence more processing altogether. When the interference density 
is very high, however, the number of interferers in the signal-of-interest bandwidth 
may exceed the array degrees of freedom, necessitating the use of subbands and 
living with nonexploitive and less effective array-processing techniques. 

Subbanding strategy determination. An efficient subbanding strategy requires a 
process of interference environment assessment that evaluates the disposition of 
signals of interest and interferers to provide a cost-effective strategy for array process¬ 
ing. If the environment is not dense, some form of this assessment will still be neces¬ 
sary, although it will not be as difficult as if the environment were crowded. Current 
array processors do not fully address how this environment-assessment processing 
should be carried out; developments in this arena are also a current research topic. 

5.5 Signal Sorting 


The abundance of signals in the HF band leads to a requirement for signal sorting. 
Once a signal has been detected, the sensor will be able to characterize the detected 
signal in various ways. But it is clearly inappropriate to expend significant signal¬ 
processing resources on signals that are of no interest. Characterization, classification, 
and recognition arc terms associated with building; a hierarchical understanding of all 
signals in the environment without spending excessive resources bv probing everv sig¬ 
nal to the fullest extent. Hierarchical sorting refers to a multilevel process of signal 
sorting, similar to searching a library for a specific book. 1 he first level sort is the basic 
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the final level might be title. In the sig 
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5.4 Reconstruction 

Reconstruction is an extension of the copy process. It is employed when the inter¬ 
ference environment varies across the bandwidth of the signal of interest. In such 
situations, the bandwidth occupied by the signal is divided into subbands, allowing 
different copy processing to be utilized in each subband. The copy process for each 
subband will provide a clean replica of each transmitted signal subband, and these 
subbands will require coherent combining to obtain a complete estimate of the trans¬ 
mitted signal. Reconstruction is the term given to such recombining; it will be covered 
in more detail at the end of chapter 11. 

The motivation for reconstruction lies with environments that are dense with inter¬ 
fering signals. As will be detailed in chapter 6, an array with N antenna elements has 
N — 1 degrees of freedom; that is, the ability to place interference suppressing nulls on 
N — 1 interferers. If there are so many interferers that the signal of interest is obscured, 
there may be more than N — 1 interferers in the bandwidth occupied by the desired 
signal. If the composite signal at each antenna is processed into subbands, each cov¬ 
ering a portion of the signal of interest, then it is likely that fewer interferers will 
lie in each subband than in the total bandwidth. The number of subbands can be 
increased until no subband includes more than N — 1 interferers. As a result, the array 
will have sufficient degrees of freedom to handle this dense interference environment. 
Thus, reconstruction expands the power of the N-element array to handle more than 
N — 1 interferers. The number it can now handle will depend on the interferers 5 indi¬ 
vidual bandwidths and how the interferers are distributed across the signal-of-interest 
bandwidth. 

This increase in array capability brings five important challenges: 

Spectral analysis. First, the array signals must be divided into subbands, requiring 
a spectral analysis processing step that might otherwise be unnecessary. The form 
of this spectral-analysis processing may have to adapt to the density of interfering 
signals, requiring an environment assessment as a preliminary step. 

Multi subband array processing. Second, the array-based interference-suppression 
processing must be carried out separately in each subband. This means that a set 
of antenna element weights must be determined for each subband and the weights 
applied in each of them. This processing stage, and the spectral analysis preceding 
it, may bring on substantial additional processing costs. 

Sub band combining. Third, the results of the interference-suppression processing in 
each sub band are only a portion of the signal of interest living in that sub band. For 
most signals, this will require that the subband results will have to be combined to 
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provide a final interference-free, full bandwidth signal of interest. Techniques for 
this recombination processing, herein denoted reconstruction, are a research topic 
that will be introduced in more detail in chapter 11. 

Subband antenna weight determination. As will be seen throughout chapters 7 
through 12, some of the most effective interference-suppressing techniques exploit 
specific features of the signal of interest. Constant envelope processing and cyclo¬ 
stationary processing are two examples of these techniques. Once spectral-analysis 
processing has been introduced, breaking the signal of interest into subbands, the 
signal features are no longer available to exploit. More generic interference suppres¬ 
sion array-processing techniques must be used. Since these may not be as effective 
as the signal exploitation methods, there is a conundrum: reducing the number 
of interferers by means of subbands eliminates the most effective array-processing 
techniques for suppressing multiple interferers, requiring more subbands with fewer 
interferers in each, hence more processing altogether. When the interference density 
is very high, however, the number of interferers in the signal-of-interest bandwidth 
may exceed the array degrees of freedom, necessitating the use of subbands and 
living with nonexploitive and less effective array-processing techniques. 
Subbanding strategy determination. An efficient subbanding strategy requires a 
process of interference environment assessment that evaluates the disposition of 
signals of interest and interferers to provide a cost-effective strategy for array process¬ 
ing. If the environment is not dense, some form of this assessment will still be neces¬ 
sary, although it will not be as difficult as if the environment were crowded. Current 
array processors do not fully address how this environment-assessment processing 
should be carried out; developments in this arena are also a current research topic. 

5.5 Signal Sorting 

The abundance of signals in the HF band leads to a requirement for signal sorting. 
Once a signal has been detected, the sensor will be able to characterize the detected 
signal in various ways. But it is clearly inappropriate to expend significant signal¬ 
processing resources on signals that are of no interest. Characterization, classification, 
and recognition are terms associated with building a hierarchical understanding of all 
signals in the environment without spending excessive resources by probing every sig¬ 
nal to the fullest extent. Hierarchical sorting refers to a multilevel process of signal 
sorting, similar to searching a library for a specific book. The first level sort is the basic 
book category: fiction, poetry, history, and so on; the second level might be author; and 
the final level might be title. In the signal-sorting world, characterization is the coarsest 
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level, based on simple measurements in the spectral and temporal domain: Are the sig¬ 
nal duration and bandwidth within a specified range? The term box detection has been 
used for this process. With array-based sensors, it is possible to include a coarse DOA 
estimate in this first-level sort criteria. Classification is the next level of sorting, still rely¬ 
ing on external characteristics of the signal, most often modulation type: single side¬ 
band (SSB), frequency-shift keyed (FSK), and so on. Recognition is the most detailed 
level of the sort, often involving demodulation of signal segments to allow readout of 
call signs, synchronization patterns, and specific internal signal characteristics. 

The critical notion regarding signal sorting is that it is a multistage process in which 
a series of characteristics are measured in turn, and as each measurement is completed, 
the question is asked, “Do the measurements results available so far suggest that the 
signal is of interest, or not of interest?” If the answer to the question is “not of interest,” 
no more measurements are made, and the signal is dropped from further consideration. 

As with any multistage sorting process, there is a trade-off between processing load 
and identification probability: more processing should lead to better performance. But 
the signal environment is so populous that the full range of processing is unsupportable 
if it is performed on every signal. With a hierarchical sort process, many uninteresting 
signals can be eliminated with moderate resource expenditure. There is the danger that 
a missed acceptance at one stage of the multistage process will result in a complete miss 
of an important signal. To combat this risk, the criteria used at the early stages should 
be applied more loosely than those at the end of the sorting process. 

To quantify this discussion, define P c f 0 as the probability of correct characteriza¬ 
tion, P c [ as the probability of correct classification, and P cr as the probability of correct 
recognition. Then, since all three events must be correct to achieve correct identifi¬ 
cation, the probability of correct identification, P ci d> is given by the product of these 
probabilities 

P c id = PchPclPcr • (5-3) 

An equivalent expression can be written in terms of the error probabilities. The 
probability of identification error, P etc [, is 

Pad = 1 " (1 " Peb ) (1 " Pel) (1 ~ Per ), (5.4) 

where P e j 0 is the probability of characterization error, P e i is the probability of classifi¬ 
cation error, and P er is the probability of recognition error. 

Even assuming that the detailed identification information is available from the 
demodulated data to permit perfect recognition, P cr = 1, the imperfections in the two 
previous stages have a significant effect on the end result. As demonstrated in figure 5.3, 
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Figure 5.3 Two-stage signal sorting error probability (perfect recognition assumed) 

a requirement for an identification error probability of 10 percent implies error prob¬ 
abilities of approximately 5 percent for each of the characterization and classification 
sorting stages. ' 

5.6 Geolocation 

Often, DOA or LOB estimates are intermediate contributors to a required estimate 
of a transmitter’s geographic location. But a single azimuthal bearing from a single 
sensor is not sufficient for this goal; it merely identifies a plane passing through the 
sensor on which the signal of interest lies. Geolocation addresses options for additional 
measurements. 

One option is to utilize a network of sensors, each able to make an independent 
azimuthal LOB estimate. The LOB estimates obtained from these sensors can then 
be combined in a data fusion process, leading to the desired estimate of transmitter 
location'. Typical HF transmitter targets are ground-based; that is, on the Earth’s sur¬ 
face; therefore, this process of combining multiple LOB estimates to produce a single 
transmitter location estimate is known as geolocation. Geolocation processing is the 
subject of chapter 10. 
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(b) Single-site location (azimuth, elevation, layer height 



(c) Two differential times of arrival -> two range differences 

Figure 5.4 Three geolocation estimation approaches 
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Another approach is known as single-site location processing, or SSL. Such process- 
ingutilizes not only the azimuthal LOB to identify the plane in which the transmitter 
lies; it also utilizes the elevation estimate of the arriving signal path, if available from the 
antenna array. As discussed in section 5.2, this elevation estimate is usually poor due to 
geometric dilution of precision and multimode interference. Perhaps the employment 
of a modern antenna array with appropriate processing methods will allow multimode 
ray structures to be resolved, thereby improving the elevation estimate substantially. In 
any case, with SSL, an elevation estimate is combined with estimates of the ionospheric 
layering occurring at the time of day when the two-dimensional LOB estimate was 
made. Ray-tracing methods are employed to backtrack the arriving ray to the appro¬ 
priate ionospheric reflection point and from there down to the estimated ground-based 
emitter location. Such techniques have seen little use in the past because the necessary 
two-dimensional LOB measurements have not been generally available. Once such 
measurements become commonplace, SSL will no doubt receive more attention and 
more thorough assessments. 

A third approach to geolocation that has received little attention in the past involves 
time-difference-of-arrival processing. Multiple HF receivers could relay their received 
signals to a common processing station and pairwise cross-correlate these signals in 
order to estimate time difference of arrival for each pair of sensors. The time dif¬ 
ference of arrival from each pair will determine a hyperboloid in space, and the 
intersection of two such surfaces with the Earths surface will provide the desired emit¬ 
ter location estimate. The extent to which the ionospheric refraction of HF rays will 
compromise such estimates remains a topic for future investigation. 

Figure 5.4 sketches out the concepts associated with each of these geolocation 
methods. 
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Vector Models for Adaptive Array Signals 


6.1 Overview 

Previous chapters have discussed the general nature of the signals associated with HF 
communications, indicating that the signals are typically narrowband but numerous; 
as a result, a desired signal is often simultaneously received with other signals not of 
interest that interfere with reception of the desired signal. In addition to such interfer¬ 
ence from other signals, the desired signal propagates from transmitter to receiver by 
means of ionospheric refraction,- perhaps on several different routes, or modes , each 
with its own delay, frequency shift, and propagation-induced polarization change. 
When these multiple delayed and frequency-shifted copies of the same signal combine 
at the receive antenna, reception becomes garbled, perhaps making it unintelligible. 
This self-interference is a second major issue for HF sensing that can be addressed with 
adaptive array processing. 

The goals for adaptive array processing focus on both forms of interference. First, 
the array and its processor should reduce the effects of undesired signals, suppressing 
them to the greatest extent possible, thereby allowing the receiving system to behave 
as though these interferers were not present. In the case of the self-interfering signals, 
known as multipath , a more ambitious goal might be to combine the several copies 
of the desired signal and to therewith obtain a more accurate estimate of the trans¬ 
mitted signal than if the self-interference from multipath were not present. Although 
both goals will be addressed in this book, the initial topic will be the suppression of 
undesired signals. 

As indicated throughout the previous chapters, the focus of this book is on com¬ 
munication signals. Such signals include many parameters that are not completely 
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known at the receiver, in particular the data to be communicated; if the signals were 
completely known by the receiver, no information would be communicated even if the 
receiver were able to capture a perfect replica of the transmitted signal. Nevertheless, 
although the data is unknown, the general structure of the transmitted waveform is 
usually known by the receiver, and this structure can be exploited as leverage in the 
signal reception process. 

It is important to recognize that communications and radar signal models manifest 
an important difference: the radar signal parameters are much more completely known 
at the receiver, allowing it to implement a so-called matched filter that can compensate 
for the delay and frequency shift experienced by a transmitted signal before it reaches 
the receiver after reflection by the target. In the radar case, there are typically only 
a few unknown signal parameters (e.g., delay, frequency shift, and phase), allowing 
matched filtering to be implemented for all possible values of these unknowns. In the 
communications case, the data represent thousands of unknowns associated with the 
received communications signal, and a receiver cannot be implemented with matched 
filters to all possible data vectors. Thus, a communications receiver must be designed 
from an entirely different viewpoint than a radar receiver. 

The communications receiver interference-suppression structure will be examined 
in detail as the remaining chapters unfold. First, it is appropriate to describe a mathe¬ 
matical model for the signals arriving at an array antenna; this model will be manipu¬ 
lated in subsequent chapters to provide a framework for the interference-suppression 
processing that is the overall goal. 

6.2 Array Signal Model 

The most commonly used signal model for array processing assumes that the signal 
reaches the receiving antenna array as a wave with stationary phase across a plane. 
Such a plane wave can be specified by a Poynting vector corresponding to the signal 
direction at any instant of time. The model also assumes that each element of the 
array sees the same scalar signal, and element response differences associated with a 
particular signal are due to that signals DOA, specified by the signal-specific direction 
vector u*. 1 Such a model will be assumed here. The plane wave model is sufficient to 
characterize the effects of array geometry on the signals received at the various array 
elements. 


1 This direction vector, with its origin in sensor coordinates, is usually the negative of the signal propagation 
vector, the Poynting vector P; any differences are due to coordinate system definitions. 
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The arrays considered here consist of N antenna elements, each with its own sig¬ 
nal conditioning and digitizing circuitry such that each antenna element signal can 
be identified individually; thus, if there are N antenna elements, there are N signals 
available for processing. Each of these N signals is identified with its own receiver 
channel , thus such an adaptive array system employs an N-channel receiver. In 
particular, it is appropriate to identify the complex signal observed at time t, cor¬ 
responding to the output of a baseband demodulator for the n-th element, by the 
subscripted symbol z n {t) and the collection of such signals at the N elements by 
the vector 



for 0 <t < T. 


m(t) 



Each of the {. z n } signals is complex to represent the in-phase and quadrature com¬ 
ponents of a received signal in a given channel. This complex vector notation has 
become standard in the adaptive-processing literature and is utilized throughout this 
work. Boldface, lowercase letters represent complex vectors corresponding to a tem¬ 
poral snapshot of the signals observed at the set of array elements, and boldface, 
uppercase letters represent complex matrices with columns corresponding to a set of 
such snapshots. The columns can assume various meanings, depending on the con¬ 
text. Common usage involves 1) snapshots taken at various times, usually in sequence; 
2) orthogonal polarization components from polarization-sensitive array elements; 
and 3) time-aligned snapshots, each attributable to a different source. 

As discussed in chapter 4, a signal impinging on an antenna is not received perfectly 
but is captured with some imperfections. Some of these can be identified with an addi¬ 
tive noise; others can be associated with element-to-element differences in response as 
a function of signal DOA. It will be assumed throughout this book that the addi¬ 
tive imperfections can be modeled as a single, complex, 2 3 additive noise component 
in each receiver channel. Further, the additive noise will be different in each channel 
and in fact will be assumed uncorrelated from channel to channel. This noise can be 


2 The term channel can be confusing because it can refer to the transmission medium for a particular signal or to 
the path a signal follows from a particular element in an antenna array through the early stages of signal 
processing. Here, this term will be used, we hope exclusively, with the second of these meanings. 

3 Complex noise corresponds to independent noise components that are associated with an in-phase and 
quadrature signal representation. 
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represented by the noise vector n(V), 

n\(t) 

ni{t) 

n it) = . for 0 < t < T. (6.2) 

_ ^ • 

The DOA-dependent differences of the signal are captured in the so-called array 
response vector, v(u / ), also called the steering vector. Thus, the N-channel received 
signal z (t), in response to signals from S sources, can be written as 

z (t) = v(u i)si(t) + n(^) for 0 < t < T. (6.3) 

i=1 

Here, the signals variation with time is given by sft). 

The amplitude and phase response of each array element varies with the direction of 
the incoming signal. Further, the phase difference between any pair of elements varies 
with their spacing and orientation. All these phase and amplitude differences are cap¬ 
tured in a complex vector function of angle of arrival known as the array manifold. 
Most often, the arrival angle is specified with two components, azimuth and eleva¬ 
tion; however, some array-processing scenarios are better characterized with an array 
manifold that is a function of just one of these angles. 

When there are multiple signals in the environment, it is common to define a signal 
modulation matrix S with the i^ row containing samples from the i^ signal, a received 
waveform matrix Z containing element signal samples at the i^ element in the i^ row, 
a noise matrix N containing noise samples at the i^ element in the i th row, and a steer¬ 
ing vector matrix V with the column containing the steering vector associated with 
they‘d signals steering vector. 

With these definitions, the summation in equation (6.3) is provided by the matrix 
product VS so that this equation becomes 

Z = VS + N. (6.4) 

6.3 Ideal Propagation Model 

Although the HF medium induces important propagation effects on signals refracted 
by the ionosphere, these effects are largely observable as multipath. In chapter 4, these 
multipath effects were characterized by ray splitting such that a transmitted signal 
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s(t ), after it traverses the ionosphere, is well modeled by a sum of several delayed and 
frequency-shifted versions of the original signal. Each version is identified as a mode. 
That is, 

s{t) — ^ s(t — x m ) exp(—i2jrf m t). (6.5) 

m 

In this sum, the subscript m indexes over the propagation modes induced by the 
ionospheric conditions. Since each mode is distinguished by a characteristic delay 
and frequency shift, these mode-specific values are represented by subscripts in the 
summation of equation (6.5). 

This multipath model is responsible for an important feature in the received signal 
model at an antenna array, which, after all, is a linear system: the individual delayed 
and frequency-shifted rays can each be modeled separately in their interaction with a 
specific channel of the array, and the resulting total signal in that channel becomes the 
sum of the mode-specific results for that channel. Note that the matrix model given by 
equation (6.4) can incorporate this multiple-mode modeling by including the several 
modes in the matrices V and S. 

It is now appropriate to describe how an individual mode interacts with an array 
element in detail. 

Consider a single plane wave arriving at an antenna array with N elements. It is 
necessary to establish a coordinate system for the array and to characterize the incom¬ 
ing plane wave in this coordinate frame. Once such a coordinate system is specified, 
the locations of the array elements can also be specified; then the characteristics of the 
signals reaching each of the antenna elements can be computed. This computation 
is based on the array electromagnetic model assumed throughout this book; specifi¬ 
cally, a plane wave model with no signal attenuation or other dispersive effects over 
the aperture of the array. Such a model implies that the signals impinging on each of 
the array antenna elements differ only in regard to their phase; there are no amplitude 
differences. 

In other contexts, it might be appropriate to include amplitude differences in the 
propagation model, but such situations do not arise from a single plane wave at HE It is 
only when multimode signals reach an array and coherently combine that element-to- 
element amplitude differences are observed. The propagation discussion in chapter 2 
includes^ some illustrations of these effects. 

TurningTiow to the computation of element-to-element signals, it is appropriate to 
reference the diagram included in figure 6.1. 

First, a coordinate system must be defined. By convention, such a system is a rect¬ 
angular system oriented with the z-axis upward, the y-axis aligned with north, and the 
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Z (up) 



Figure 6.1 Basic coordinate system for HF array antenna element signal model 

x-axis pointing to the east to complete a right-handed frame. These conventions are 
indicated in figure 6.1. 

For any plane wave front impinging on a specific antenna element of the receive 
array, one can specify a Poynting vector that is perpendicular to the plane wave and 
pointing to the origin of the coordinate system [1, 2, 4]. The distance from this wave 
front to the origin, at a reference time when it reaches the antenna element, provides 
a measure of the time delay associated with the arrival of the plane wave at the origin, 
relative to the time when it passes this specific antenna element. In particular, if this 
distance is d and the wave velocity is c , the relative time delay is d/ c. For narrowband 
waveforms, if the wavelength of the plane wave is A,, the time delay translates to a 
relative phase of 2 7id/\ rad. Since signals at all points on the plane wave have the 
same phase, the length of d when the plane wave passes through an antenna element 
determines the phase of the signal at the antenna element relative to the coordinate 
system origin. But d is easily computed as the inner product of a vector from the origin 
to the specific antenna element and a unit vector aligned with the Poynting vector. 

Suppose the plane wave is arriving from an azimuth a (measured clockwise from 
north, the y-axis) and an elevation 6. Then, the unit vector aligned with the Poynting 
vector is given by 


u= —[cose sin a cos € cos a sine]. 


(6.6) 
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A vector from the coordinate system origin to an element at the coordinates (x a ,y a , z a ) 
is just 

Pe — [x a Ja Z a \. (6.7) 

Therefore, if s(t) is a signal observed at the coordinate system origin, the signal 
observed at an antenna element located at position V e is 

s a (t ) = s(t) exp[— 27r/(u • V e )/X\. (6.8) 

6.4 A Simple Example 

The simplest meaningful array example one can examine involves two omnidirectional 
antenna elements a distance D apart [3]. The plane bisecting the line between these two 
elements specifies a plane of symmetry. Because the antenna elements are omnidirec¬ 
tional, there is cylindrical symmetry about the array axis. Thus, the critical geometrical 
factor is the angle between the signal Poynting vector and the array axis, or equivalently, 
the angle 6 between this vector and the plane of symmetry. A signal arriving from any 
direction on the cone defined by the angle 6 will affect this simple two-element array 
in the same way; note that tc /2 — 6 is the half-cone angle associated with this coni¬ 
cal locus of common signal arrival locations. Figure 6.2 shows this simple geometry, 
from which it should be obvious that the wave front in question arrives at Element 2 
a time D sin 9/ (2c) before it arrives at the origin and at Element 1 an identical time 
after the wave front passes the origin. Thus, the time delay observed between a wave 



Figure 6.2 Two-element array: two-dimensional geometry in plane containing array axis 
and signal Poynting vector 
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front arrival at the two antenna elements is D sin 9/(c). For a narrowband waveform, 
a delay of one radio-frequency period corresponds to a phase shift of 2tv rad; that is, 
A (j) = 2nf At , where/' is the carrier frequency of the wave. Therefore, the time delay 
between a wave front arrival at these two elements translates to a phase difference of 
2nf{D/ c) sin 6 = 2n {D/X) sin 6 . 

The simplest possible array processor entails merely adding the signals from the two 
elements together. The resulting array output is thus 

y(t) = s(t){exp[+m(D/X) sin0] + exp[— iit{D/X) sin#]}. (6.9) 

The term in the curly brackets characterizes the directional behavior of this simple 
array. If x/r is defined to be 2jt{D/X) sin 0,y(t) becomes 

y{t) = s(t){exp[+ix/f/2] + exp[— ixfr/2]} = 2s(t) cos(xj//2). (6.10) 

The magnitude squared of this expression, normalized to its value at 6 = 0 (thus, 
x\r — 0) and expressed in decibels, is just 

G{xj/) = 10 log 10 cos 2 \[f/ 2. (6.11) 

Note that this expression is a function of the variable x/s, which is a compound vari¬ 
able depending on both the distance between antenna elements Z), normalized to RF 
wavelength X, and the sine of the arrival angle. It is interesting to explore the antenna 
pattern, which characterizes the response of the two-element array as a function of the 
signal arrival angle, as the spacing between the antenna elements, normalized to wave¬ 
length, varies. Figure 6.3 shows this pattern, which is a plot based on equation (6.11) 
for a specified value of D/X. 

The antenna pattern for D/X = 0.5, half-wavelength spacing, has a single main lobe, 
rapidly decreasing to a null at an azimuth angle of about 80°. On the other hand, for 
larger values of the ratio D/X or for element spacing greater that half wavelength, the 
first null of the pattern for D/X = 0.5 moves from the arrival angle of 7r/2 rad, to a 
smaller angle; thus these patterns can exhibit multiple peaks, all at the same gain value. 
Figures 6.4 and 6.5 illustrate this effect. These peaks are indicative of the ambiguities 
that arise when the antenna elements are too widely spaced relative to wavelength. This 
array design issue will be discussed more fully after the topic of array direction-finding 
accuracy is addressed. 

Although the directional patterns in figures 6.3—6.5 have been normalized to their 
unnormalized value at 6 = 0, it is important to realize that the magnitude of the sum 
is 2 at this angle, and thus the power gain at this angle is 6 dB. This observation is a 
hint of one advantage of using multiple antenna elements in an antenna system. 





Power gain (dB) Power gain (dB) 
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Figure 6.4 Two-element antenna patterns for D/X = 1.0 
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Next, the discussion will turn to a simple direction-finding approach based on this 
same simple array configuration. Consider the patterns associated with a processor that 
subtracts rather than adds the outputs of the two antenna elements. Now the result is 

= s(t){exp[+i\/f/2] — exp[—fi/r/2]} = 2is(t) sin(i/r/2). (6.12) 

The directional pattern in decibels for this processing scheme, normalized to the peak 
for the additive processor, becomes 

10log 10 [sin 2 (^/2)]. (6.13) 


If both the additive and the subtractive processors are implemented and their ratio is 
examined, one has 


2ism 2 (\j//2) 
2 cos 2 (V / '/2) 


/ tan 2 (^/2). 


(6.14) 


If now the sign of the difference is detected by the processor and incorporated into the 
output pattern, the result, in decibels, is 


10 log 10 {tan 2 (f/2) • sign[tan(i^/2)]}, 


(6.15) 
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Arrival angle (deg) 

Figure 6.6 Sum and difference processing output near pattern midpoint 

which exhibits a near-linear variation around the array plane of symmetry, as shown in 
figure 6.6. Such processing provides a simple direction-finding capability, much like 
that implemented in early direction-finding systems explored by Marconi. 

6.5 Beam Formation 

-- 1 , 

The examples in the previous section assumed the two element signals were added 
without element-to-element gain or phase differences. If the combination includes 
such differences, the result is a weighted sum of the two element signals. Extending 
this concept to an array with TV elements implies the formation of a sum of TV complex 
outputs of the TV array elements, producing a single complex output; this summation 
process is known as beam formation. If a single plane wave impinges on the array, the 
output sum will vary as a function of its direction of arrival, suggesting the term beam 
for the result. For a weight vector w, such a sum is represented in matrix notation by 

s(t) = w H z(^). (6.16) 

The variation of this beam with angle of arrival is known as a beam pattern. Using the 
two element array of figure 6.3 as an example, consider applying a phase difference of 
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7r/4 to these elements. This weighting is expressed by the weight vector 


w = 


1 

exp(m/4) 


(6.17) 


In the case of the simple array with D/X = 0.5 illustrated in figure 6.3, application of 
this weight vector to the element signal vector produces the beam pattern shown in 
figure 6.7. 

It is clear that this phase weighting steers the beam off boresight, as expected. Vary¬ 
ing the phase weighting changes the direction to which the beam is steered. As will be 
discussed in detail in succeeding chapters, more complex weighting can be used not 
only to steer the peak of the beam but also to locate deep nulls that can be used to 
suppress interference. 


6.6 Weight Vector Computation 


As described in the previous section, summing the array element outputs, with differ¬ 
ent complex weights applied to each, produces a single complex output, the magnitude 
of which varies with the arriving signal direction of arrival. The remainder of this 



Figure 6.7 Simple two-element array output with phase weighting 
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volume will focus largely on the specification of these weights and on ways to produce 
desirable properties in the resulting beam. As has been stated the goals are to produce 
gain in the direction of a desired signal and nulls, or at least low gain, in the directions 
of interferers. 

A frequent goal is the formation of a beam that maximizes the signal-to-noise ratio at 
the array output [5]. From equation (6.16), if the array element outputs are t — s + n, 
where s represents the signal components at the array elements and n represents the 

noise components, the output signal power can be represented by s= \w H s\ 2 and the 

output noise power by n — \w H n\ 2 . As a result, assuming the weights remain constant, 
the output signal-to-noise ratio becomes 


/ s\ w^ss^Jw w^Rjj w 
w^[nn^]w w^R^w 

which can be written as a Rayleigh quotient as 

/ S \ _ Z H Rm! /2 R ss R~in 2 t 
\«/ Z^Z 


(6.18) 


(6.19) 


where z=R„ 2 \v. Such a quotient is maximized by the maximum eigenvalue of the 

symmetric matrix in the numerator, K JW K SS K WJ — K~ J ? K SS . 

Since the eigenequation for this eigenvalue is given by 

R- j / 2 R ss R-]/ 2 z = Xz (6.20) 

and z = Rmf w, the optimum weights must satisfy 

R“ 7 ; R„w = Aw. (6.21) 

If A. is chosen as the maximum eigenvalue and such an eigenvalue maximizes the signal- 
to-noise in equation (6.19), the weights corresponding to the eigenvector of the largest 
eigenvalue in this equation are those that maximize the signal-to-noise ratio. As shall 
be seen later, this set of weights creates a beam in the direction of the desired signal 
while placing low gain values in the directions of interferers that generate the noise 
covariance R ;/;/ . 
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6.7 Polarization-Sensitive Arrays 
6.7.1 Physical Motivation 

Antenna array elements can exhibit sensitivity to impinging wave polarization. Such a 
sensitivity can be used to advantage in processing HF signals since wave propagation 
effects can cause impinging waves from different sources to differ in their effects on 
the various array elements. As a result, polarization enhances the differentiability of 
incoming signals in many situations. This section introduces the notation needed to 
present array-processing techniques with polarization-sensitive array elements. 

Polarization-based processing [6] extends the definition of an array response vector 
to include characterization of an array’s response to signals as a function of their polar¬ 
ization state. By convention, the polarization state of a plane wave signal is described 
in terms of its horizontal and vertical electric field vector components. The horizontal 
electric field component is perpendicular to the plane of incidence defined by a verti¬ 
cal plane at a given azimuth angle 0; thus, this horizontal electric field component is 
often identified as the 0-component. Recalling that the Poynting vector u lies in this 
incidence plane and that the electric field rotates in a plane orthogonal to it, the verti¬ 
cal electric field component lies in the plane of incidence orthogonal to the Poynting 
vector. Because the zenith angle 6 is measured in this vertical incidence plane, this elec¬ 
tric field component is identified as the 6 -component. Note that although this electric 
field component lies in the vertical plane, it is only vertical (parallel to the £-axis) when 
6 = jt/2. 

A vertically polarized antenna will only sense the vertical component of the wave, 
and a horizontally polarized antenna will only sense the horizontal component. 
Because most HF signals are carried by waves with elliptical polarization; that is, 
with phase-related vertical and horizontal components, an array of vertically polar¬ 
ized antennas, such as vertical monopoles, will only observe the vertical component 
of the arriving wave. Although the elements will be able to capture at least some por¬ 
tion of the wave energy, the energy carried by the polarization component of the wave 
orthogonal to the antenna polarization will be missed. If the array antennas are iden¬ 
tical, they will all capture the same energy fraction (to the extent that the wave front 
does not vary across the array). 

When characterizing the array response vector for polarized waves, the vector must 
include a factor representing its response as a function of the incoming wave polar¬ 
ization. Should this factor be common to all antenna elements, it will be common to 
all components of the array response vector. It can therefore be factored out, and even 
though this factor will affect the fraction of the incoming wave energy captured by 
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the antenna elements, this factor can essentially be ignored in any direction-finding 
array-processing algorithm. 

If all array elements are identical and dual polarized, with separate output ports for 
each of the two orthogonal polarizations, each element will capture both polarization 
components of the incoming wave, making the elements more efficient. If these ele¬ 
ments are all physically oriented in the same direction, their element patterns (response 
as a function of impinging signal direction, u) will be identical and can be factored 
out of the total array response, leaving the element phase center displacements as the 
only contributors to direction-of-arrival determination. 

Now if the array elements are not identical but instead manifest diverse polarization 
characteristics, eachelementwillresponddifferentlytoanincomingwave. Thus, thearray 
response vector components will not only include the element-to-element time delay 
component characterizing identical element arrays but will also include a polarization- 
sensitive factor that is different from one element to the next. This will not factor out of 
the array response vector as it would with identical elements. As a result, the polarization 
sensitivity should be included in the design of an array-processing technique. 

Most land-based HF antenna arrays fielded to date have included only identical 
vertical antenna elements. Shipboard arrays may be outfitted with identical elements, 
but their proximity to structural components of the host ship give them a polarization 
characteristic that is not only sensitive to both polarizations of an arriving wave but also 
different from one element to the next, perhaps in unpredictable ways. Once such an 
array is installed, it is possible to develop a polarization-sensitive calibration through 
measurement or electromagnetic modeling and to incorporate the element-specific 
polarization characteristic in the array steering vector. Because, as suggested earlier, 
polarization-sensitive elements have the potential for performance improvements, it is 
worthwhile to understand those opportunities available if a polarization observation 
capability is implemented in the processing. 

6.7.2 Modeling Polarization-Sensitive Elements 

It is important to remember that for arrays with polarization diversity, the steering 
vector includes dependencies not only on the angle of arrival but also on the signal 
polarization characteristic. As has been assumed throughout this work, radio waves 
have a transverse electromagnetic (TEM) form, with their electric and magnetic fields 
in a plane that is orthogonal to the direction of propagation. One can describe such a 
wave using a coordinate system that utilizes unit vectors in the 0, v, and — r directions 
(the wave is propagating toward the coordinate frame origin) corresponding to the 
usual spherical coordinate system. Since this is a TEM wave, the electric field has no 
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Figure 6.8 Coordinate system associated with polarized signals 


component in the r-direction and can be represented by the vector sum of its 0 and v 
components 

£ — + £ v u v . ( 6 . 22 ) 

Here u 0 and n v are unit vectors in the azimuth and elevation directions, respectively. 

Figure 6.8 illustrates the coordinate directions. If the wave is characterized by an 
amplitude s(t), phase r)(t ), carrier frequency co 0 , and polarization parameters y and 
£, these components are specified at a time t by 

£ v =s(t) sin y exp {j[co 0 t-\- r]{t) + £]}, (6.23) 

and 

£(j> = s(t) cos y exp {j[co 0 t + ri(t)]}. (6.24) 

Such a wave can be represented in a two-dimensional space by a complex vector 

p(x>£)= f U (6.25) 

sin y exp{y[£(?)]} 
cos y 



s(t)exp[j(co 0 t + ?y(f))]. (6.26) 
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Note that in this space, the 0-coordinate is horizontal, and thus the phase reference 
in this representation is the horizontal polarization vector. 

Each antenna in an array can be characterized by its response to linearly polar¬ 
ized waves in the x, y, and £ directions. It is important to recognize that an antenna 
may respond differently to the electric and magnetic components of an impinging 
TEM wave. Therefore, these two components must be considered separately. Thus, 
the arriving wave should be described in a six-dimensional space corresponding to the 
x, y , and z components of the electric and magnetic fields. One can then define an 
antenna response by six complex numbers corresponding to the antenna responses to 
unit electric and magnetic fields polarized in each of the three coordinate directions. 
Then, the specific sensitivities of an antenna to each of these six field components can 
be accommodated by the antenna response model as an inner product of the arriving 
wave components and the corresponding antenna sensitivity components. Note that 
because the array is assumed to have elements with diverse polarization, these responses 
can be different for each antenna element. 

This approach leads to a six-component field component vector with the first three 
components describing the electric field in the three orthogonal directions and the 
remaining three components describing the comparable magnetic field components: 


= ©(0,0)p(y,?j). (6.27) 


4 

Ey 

E z 

H x 

Hy 

Hr. 


Here the product on the right side provides the relationship between the field compo¬ 
nents and the corresponding direction of arrival/polarization characteristic of the arriv¬ 
ing wave. 0(6, 0) is a six-row, two-column matrix indicating the six field components 
for vertical and horizontal polarizations for a wave arriving from the direction (0,0): 


©( 0 , 0 ) 


cos(0) COs(0) 
sin(0) cos(0) 

— sin(0) 

— sin(0) 
cos(0) 

0 


— sin(0) 
cos(0) 

0 

— cos(0) cos(0) 

— sin(0) cos(0) 

sin(0) 


(6.28) 


and p (y, rj) is the polarization vector defined in equation (6.25). 
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Each antenna element is assumed to have different sensitivities to a wave with a 
specific polarization characteristic, making this response different for each antenna 
element. Thus, it is appropriate to index the electric and magnetic field sensitivities 
v ex , Vey, v ez , v mx , v m y, and v mZi each of which is a complex number to represent ampli¬ 
tude and phase, with a superscript, for example, v n ex , to specify the relevant antenna 
element. Further, in modeling the phase differences between elements due to their 
relative location in the array, a phase coefficient of the form exp(y'2jrP^ • u) must be 
included; here V e is a vector representing the geometric coordinates of the n-th antenna 
element, and u is a unit vector in the direction of the impinging wave. 

As a result, the complete model for the antenna element at coordinates V e becomes 
the complex scalar 

T - -i 

Vex 
Vey 

ez exp (j2jt T? e • u) (6.29) 

Vmx 
V m y 

_ Vmz _ 


(6.30) 


Equation (6.30) specifies the response of one antenna element to a single signal. When 
there are multiple signals present, the receiver will add their responses in the usual way. 

Polarized antenna elements enable extensions of the steering vector concept, mak¬ 
ing a steering vector sensitive to the polarization, as well as the direction, of the arriving 
wave. Because the polarization of a wave can be characterized by its vertical and hori¬ 
zontal components, it is appropriate to define two steering vectors to describe an array 
response, v/, characterizing the response to a horizontally polarized wave and \ v char¬ 
acterizing the response to a vertically polarized wave. Such a wave is characterized by 
its angle of arrival 0, as well as by a 2-vector that specifies the relative strengths of its 
two polarization components, k^. Since an arbitrarily polarized arriving wave can be 
described as a linear combination of horizontal and vertical waves, the response of the 


Vex 

V ey 

= [0(0,0)p(y, T])] r Vez exp(y'2;rP f • u). 

Vmx 

Vmy 

V m z _ 
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array to a wave arriving from a direction 6 with polarization state k p can be sped- 

fied by the N x 2 matrix \ p (9) and can be written as a linear combination of and 
\ v , weighted by the polarization state components. Using matrix notation, this array 
response can be written as V^k^,, 

b(@,k p )=V p (6)k p , (6.31) 

where k^, is the two-component vector 4 describing the polarization state of the 

incoming wave, and V p (0) is the N x 2 matrix [v/j, v v \. 

As will be discussed in chapter 9, the basic approach to incorporating polarization 
in an array-processing algorithm extends the response of the array to include polariza¬ 
tion sensitivities and to incorporate these sensitivities in the metrics associated with 
the particular algorithm. Then, because the polarization state of the impinging wave 
is generally unknown, its polarization becomes a nuisance parameter that must be 
optimized over as a step in the appropriate computations. 
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Signal Detection Processing 


7.1 Overview 

Now that the fundamental concepts behind adaptive array processing for HF appli¬ 
cations have been covered, it is time to move into the meat of this work, specific 
algorithms for processing the data collected by a multichannel HF sensor. This chapter 
begins the tour, covering many of the important methods for detection; that is, pro¬ 
cessing the set of array element outputs to declare that a new signal has appeared among 
the myriad of other signals present in the HF signal environment. 

As discussed in chapter 5, much of this text addresses how an antenna array can 
extract information from a specific HF signal under the assumption that the signal is 
known to be present in the environment and that the processor has at least approximate 
information about its time and frequency extent. The initial stage in prosecuting such 
signals must therefore be the detection of a specific signal and in the process learning 
when it starts and ends. This chapter focuses on detection methods used to assess 
the signal environment in order to determine when a signal starts, when it ends, and 
what is its frequency range. Such topics are all considered to be part of the detection 
problem. 

First, in the HF application, it is important to recognize that the arrays of concern 
are small relative to the propagation distances involved. This observation motivates 
the view that a single plane wave reaching an array excites all elements with the same 
amplitude. If there are initially no signals present and one arrives, similar amplitude 
changes will occur at all elements, providing redundancy for use in potential detection 
methods. However, if the array elements are not identical, the responses to this exci¬ 
tation will almost certainly differ, perhaps substantially. Further, if there were other 
signals present before the new signal arrives, the new signal will combine with those 
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already present to generate further differences among element responses. Even if there 
were no signals present prior to the arrival of a new one, local reflections can generate 
coherent multipath that corrupts a simple model consisting of a single plane wave that 
is to be detected in receiver noise. These various factors make the detection of a signal 
in the HF environment challenging and complex. 

As described in the previous chapter, a multichannel receiver senses the environ¬ 
ment with an array of antenna elements* providing a series of complex data samples 
from each. The focus in any detection discussion is choosing, between two possibilities: 
1) the receiver output samples represent the sum of a specific signal together with addi¬ 
tive noise, or 2) these samples represent additive noise alone. Models; for the signal and 
the noise are used to develop a probabilistic basis for such a detection decision. Most 
often, the noise is modeled as white (successive noise samples are statistically indepen¬ 
dent) and Gaussian; for a multichannel receiver, the noise samples; in one channel are 
assumed to be independent of those in another. Such a model is a good match to a 
situation where receiver front-end noise is the only barrier to perfect reception. 

However, at HF, noise is often due to sources other than receiver front-end noise. 
Atmospheric noise; sky noise; and most importantly, other signals; that spectrally (and 
temporally) overlap the signal of interest can dominate the front-end noise. Interfer¬ 
ence from other signals often has spatial’ structure and may actually be a coherent signal 
that is not of interest to the receiver operator: General models for such interference are 
difficult to identify because the. range of possibilities is so large and because the spec¬ 
tral and temporal characteristics of this, interference interact with those of the signal 
of interest in very specific ways. 

The chapter will start with a discussion of some issues peculiar to the HF environ¬ 
ment and to the signals that inhabit it. It will then continue with a review of the basic 
maximum likelihood detection formalism with an emphasis on multichannel receiver 
modeling. In its details, the detection problem involves a generalized likelihood ratio 
test (GLRT) in which maximum likelihood estimates of the unknown signal para¬ 
meters are used for the alternative hypotheses. Because such receivers can sense the 
spatial distribution of the interference, they provide interference-suppression leverage 
that is singularly important at HF. This observation prompts a discussion of detection 
as a decision about a change in the environment. This topic has been covered in the 
statistics literature when identifying a change in an observed environment is the goal; 
some useful results from that literature will be reviewed next. In particular, because 
the HF band is far wider than that of any one HF signal and the signal of interest 
duration is often brief, wideband processing with good temporal resolution is needed. 
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Both temporal- and spectral-processing architectures will be examined to address these 
concerns. This discussion will lead to some specific change-detection algorithms that 
have been useful for HF sensor applications. 

7.2 HF Environmental Detection Issues 

The change-detection approach identified in the previous section will be applied to the 
HF signaling environment in the next two sections. As a prelude, it is appropriate to 
heuristically describe those salient features of the HF environment important enough 
to be considered when detailing these applications. These features can be described 
by two adjectives: congested and variable. Both of these aspects arise directly from the 
long-range propagation phenomena that underly usage of the HF band. A few details 
about each aspect of the HF environment are in order. 

7.2.1 HF Spectral Occupancy 

The HF band, from 2 to 30 MHz, has been widely utilized since the advent of radio 
communications because over-the-horizon propagation is enabled by ionospheric 
refraction. The phenomenology of this process was summarized briefly in chapter 2. 
Here, the aim is to present a brief “demographic” concerning the population of sig¬ 
nals in this band, their strength, their temporal and spectral character, and their 
occurrence. Unfortunately, the occupancy data relevant to this topic are sketchy and 
anecdotal. Predictions based on known transmitters, using their antenna character¬ 
istics and transmitter powers as well as ionospheric propagation prediction software, 
such as IONCAP, are unreliable because there are many unregistered HF users, and 
the patterns of usage by all users is so variable. Some measurements have been made, 
but a detailed description of seasonal, diurnal, and sunspot number variations has only 
been published for a few sites. Projecting such site-specific information to a new site 
without in situ measurements at that site is unreliable at best. In summary, a broad 
view of spectral occupancy is difficult to distill from the scattered reports. 

Most measurements have generally been performed with a spectrum analyzer or 
with a narrowband (1—4 kHz) HF receiver using computer-controlled frequency tun¬ 
ing. Scheduling the receiver is a trade between long dwells with infrequent revisits 
of a particular frequency or rapid-fire short dwells that are inadequate for signal 
characterization. Either way, multiple collection campaigns are necessary to cover the 
longer-term temporal variations. 
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Some measurement projects have been motivated by efforts directed toward inter¬ 
ference mitigation modulation and/or modem designs [12]. Two measurement efforts 
stand out [12,17]. Narrowband measurements by Laycock and Gott employed a 1 kHz 
bandwidth and a 1 s dwell time at a given frequency. A channel was deemed “occupied” 
at a specific signal level if the signal exceeded that level for the entire 1 s dwell. Measure¬ 
ments were conducted near noontime and midnight because propagation conditions 
were most stable at those times. Three-day measurement campaigns were undertaken. 

These spectral occupancy results were subdivided by the ninety-five subband 
allocations of the International Telegraph Union (ITU). The model by Laycott et al. 
expresses occupancy (percent occupied) as an analytic function of critical parameters, 
including frequency, time of day, bandwidth, season, geographic location, and sunspot 
number. This function takes the form 

Q = expy/[l + expy], (7.1) 

where 

y — A-k + BP r + (c 0 + c\fk + c 0 f^)nssN- (7.2) 

Here is indexed on the ninety-five ITU allocations, B is a coefficient of the receiver 
threshold level P r , and is the center frequency of the allocation. c 0 and c\ are 
constants. 

Although sunspot number is included in the model, there is no provision for mod¬ 
eling time of day, season, or geographic location. Nevertheless, this model is widely 
accepted and used for planning purposes. 

The second approach that has received attention, by Perry and Abraham [17], uti¬ 
lized a wideband 1 MHz bandwidth measurement system sampling at a 2.5 Msps rate 
with a 105 ms dwell. Four such dwells were recorded every minute. The resulting mea¬ 
surements were used to construct a spectral occupancy model [18]. Distributions of 
pmini the external noise floor, and p m axi the highest interference level in a narrowband 
cell, were produced from the wideband data using Fourier transform spectral analysis 
techniques. 

This model was used to derive the percentage of cells containing interference at 
a given signal-to-noise ratio. Typical results from measurements in the rural eastern 
United States show 10 percent to 40 percent of frequency cells contain interference, 
depending, of course, on threshold. 

In summary, the limited availability of measured data makes generalizations dif¬ 
ficult. Nevertheless, all measurements to date suggest a crowded HF spectrum; 
mitigation by adaptive array-processing methods will certainly be valuable. 
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7.2.2 HF Propagation Statistics 

A second consideration in the development of detection techniques concerns the vari¬ 
ability and unpredictability of path loss from transmitter to receiver. As described in 
chapter 2, the diurnal variation in path loss at a given frequency is huge. This fact will 
determine whether a given transmitted signal can be received at all at a given receiver. 

Fading is more relevant to the detection problem, however. This often rapid 
variation in signal strength can be due to one or more of these causes: 

• The polarization of the propagating wave rotates. 

• Ionospheric changes with time due to ionospheric interaction with solar 
energy. 

o Layering structure 
o Absorption 

• Ionospheric electron content changes along the propagation path force the maxi¬ 
mum usable frequency below the frequency in use, temporarily causing channel 
dropouts. 

Figure 7.1 provides an example of predicted signal strength from a specific trans¬ 
mitter operating at a particular frequency over a twenty four-hour cycle. 

Figure 7.2 shows a typical scintillation example. Such rapid variations are common¬ 
place at HF. 

7.3 Likelihood Ratio Detection 

V 

Basic GLRTs for detection have been presented by many authors [3, 2], and it is not 
appropriate to cover the topic in great detail here. Nevertheless, a brief outline of 
detection modeling is necessary to provide the notation used in the subsequent GLRT 
detection extensions that are the primary focus of this chapter. 

7.3.1 Probabilistic Data Model 

Recall the receiver model for a sequence of complex samples z obtained by digitizing 
the multichannel front-end output, z is a length -N vector of synchronous snapshots 
across the multiple channels. Assuming that additive, white Gaussian noise is the 
primary source of interference, this vector is again modeled by 

z(t) = Vs(t) + n(V). 


(7.3) 
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Here the number of channels is N, the number of snapshots is Z, and the number 
of signals is N s . V is an TV x N s matrix with columns containing the steering vectors 
associated with the several signals represented by the complex samples at time t, in the 
rows of the A^-vector s(t). 

Under the previous assumptions concerning the vector of noise samples, its 
probability density function (pdf) is given by a complex circular Gaussian density 

p(n) = 7t~ N \ E exp [—tr ( X _1 nn H ) ]. (7.4) 

If £, the covariance matrix of the noise, is assumed to be white, with independent 
channel-to-channel receiver noise, it would be expressed asX=a 2 U, where Iyv is an 
N x N identity matrix. However, at HF the background can be due to a variety of 
sources, and therefore this assumption will not be made in the forgoing. Note that the 
exponent of 7r in equation (7.4) is —N and not —N/ 2, as would be the case if the 
samples were real. Here each complex sample has two degrees of freedom, doubling 
the total degrees of freedom associated with the A^-vector. 

Now, applying the signal model of equation (7.3), one obtains the probability den¬ 
sity function conditional on each of the alternative hypotheses. Under hypothesis 
that the receiver output vector is due only to noise, the conditional pdf for this vector is 

p(z\Ho) = 7 z'- n \ sr ; exp[—ttCX-V 7 )]. (7.5) 

Alternatively, if there are multiple signals present, each with its own steering vector 
modeled in the columns of V, under hypothesis T~L\ that the receiver output vector is 
due to both signals and noise, the conditional pdf for the received vector is 

p{z\Ui) =y- Af exp{-tr[i:- i (z- Vs(f))(z-Vs(f)*)]}. (7.6) 

Normally, when one is designing such a basic hypothesis test, the hypotheses consider 
whether one signal or no signal is present. Thus, the signal vector used in the detector 
design can be a scalar function of time instead of a comparable vector that describes 
several (N s ) signals. 

In the HF signal environment, this goal is particularly relevant because the envi¬ 
ronment is so undisciplined and crowded. Signals begin quite arbitrarily and from 
independent transmitters. As a result, it is rare that two signals will turn on simul¬ 
taneously. Furthermore, most detecting systems are continually monitoring the envi¬ 
ronment and as a result have the opportunity to observe every signal start. As will be 
discussed shortly, the multiple signal issue is most concerned with detecting whether a 
new signal starts when one or more other signals are already under observation. In such 
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a circumstance, the start of a newly arriving signal must be detected in a background 
consisting of one or more ongoing signals in addition to interference from various 
noise sources. Then, the detection model must be modified to include such a back¬ 
ground as the alternative hypothesis, instead of the noise-alone background specified 
in equation (7.4). 

7.3.2 Likelihood Ratio Tests 

Many texts covering binary hypothesis testing describe various ways to utilize the sta¬ 
tistical characterization of an environment for decision making. The Bayes criterion 
and the Ney man-Pears on criterion are two of the most common. Detailed discussions 
of these criteria lead to the formation of a ratio of likelihood functions, resulting in a 
so-called likelihood ratio test (LRT) that computes the ratio A (z) 

A(, )s »(jW 

Pz\'H 0 (z\'U 0 ) 

and compares it to a threshold; this threshold is established differently for the various 
criteria. Then, if the threshold rj is used, the LRT can be summarized as 

A(z) r h (7.8) 

Because the natural logarithm is a monotone function and because Gaussian statistics 
are often encountered in signal-processing problems, it is useful to define an equivalent 
LRT based on taking the natural log of both sides in equation (7.8). 

7.3.3 LRT Example 

To illustrate the LRT with a realistic example, consider the following situation. Assume 
an input data stream consisting of 100 ten-sample segments of a real Gaussian process. 
In each segment, the mean of the Gaussian process is either zero or one. These 1000 
signal samples are presented to an LRT, and the goal of the test is to decide which of 
the two mean values is relevant to the segment. To carry out this test, it is appropriate 
to employ equation (7.6) and its counterpart under the alternative hypothesis. 

In this case, the process is a scalar process so that no covariance matrix is involved. 
Nevertheless, the computations 

In p{z\Hi) = —tr (z — Si)(z — si) H /cr 2 , i=0,l 


(7.7) 


(7.9) 



Signal Detection Processing 159 



Sample number 


Figure 7.3 Comparison of input(z) and output decision (0 or 1) for likelihood ratio test 
example 

can be carried out and their difference compared to a threshold. In this simple example, 
a threshold midway between the two alternative mean values is appropriate, but if there 
is a different a priori distribution for the two alternatives, it can be reflected in the 
threshold setting. 

The results of carrying out this process are presented in figure 7.3. It is clear that 
the LRT performs its task effectively, no doubt the result of a sufficiently high signal- 
to-noise ratio (10 dB). This example could be used to explore the effects of various 
a priori hypotheses and/or costs associated with errors in either direction. Such results 
would lead to so-called receiver operating characteristics, which illustrate the trade¬ 
off between detection and false alarm probabilities. Such computations are left to the 
reader’s discretion. 

7.4 Perturbation Discovery 

The previous section addressed a general approach to detection: for a given set of 
data samples, decide if they arise from an environment consisting of noise alone or 
if they arise from an environment consisting of a signal together with additive noise. 
In this section, the problem is generalized, and a time element is added to the mix: 
for a given set of data samples, the first portion of which arise from one environment 
and the remainder of which arise from a changed environment, decide first when 
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the change occurs and second, estimate parameter vectors associated with the two 
environmental states. 

This problem has been addressed in several contexts [4, 5, 6]; as will be seen, the 
published results are applicable to the present problem: the detection of a new signal in 
an HF signaling environment when the initial environment includes several signals as 
well as receiver noise. The context of this earlier work varies but includes segmentation 
of speech signals into segments, failure detection, tsunami detection, and tracking of 
nonstationary environments with adaptive control loops. 

A series of increasingly complex models serves to illustrate the critical ideas. These 
models fit into a useful framework. Consider the following general model: 

Assume the detector is presented with a series of real samples, z\,Z 2 , .zi repre¬ 
senting an environment characterized by a parameter vector 9 . The first subgroup of 
r samples may follow a model specified by a vector 9 0 , while the remainder may 
follow a model A4i, specified by a vector 9\. The decision process is to decide between 
the null hypothesis, H 0 : all L samples are consistent with model JV[ 0 and the alter¬ 
native whereby the first r samples are consistent with model and the following 
(L — r) samples are consistent with model A4\ . Further, the process should determine 
the sample number r corresponding to the change and the parameter vectors associated 
with the two model states. 

Note that this problem formulation assumes that there is only one transition in 
the set of L samples. This assumption forces a restriction on environments in which 
applications might be useful. But this restriction can be invoked by the appropriate 
selection of L and the allowed alternatives for r. Examples given later in this chapter 
will clarify how this might be done. 

7.4.1 A Simple Model: Change of Gaussian Mean 

With this description, a useful probabilistic model for change detection can be devel¬ 
oped. A simplified example is useful preparation for the more complex situations 
associated with new signal detection in a typical HF environment. This first specific 
model is based on observations of a real Gaussian random variable with varying mean. 

Consider a series of L samples of a white, Gaussian process with variance a 2 and 
mean /x. A maximum likelihood approach is described here. 

Two issues should be addressed in designing this process. First, false alarms, the 
decision that a jump in mean did occur at some point in the record of L samples, 
even though none did, are to be avoided. Second, the decision that a change in mean 
does occur—the detection of an event—should be made as soon thereafter as possible. 
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Clearly, there is a trade-off between these two figures of merit. For the present, the 
optimal solution will be based on a minimum detection delay for a fixed mean time 
between false alarms. > 

Assume that a set of L observations z(\),z(2), :..,z(L) have been made and that 
both fyi 0 and /jl\ are known. Consider two hypotheses: H\, that there is a change in the 
mean starting with sample r, and the alternate hypothesis H 0 , that there is no change 
in mean throughout the observation set. A GLRT is applicable is these circumstances. 
Thus, a likelihood ratio is formed, grouping the observations into a vector z. 

A s = - 

p(AH 0 ) 

YY^pM VLtPM 

Y\k=\po( z k) 

_ rr p\ (zk) 

\ po(zk) 
k—r 

Here, p 0 (z>k) is the probability density function for the samples under hypothesis H 0 , 
and p\(zfc) is the probability density function for the samples under hypothesis H\; 
that is, 

pi(z) = — 1 — exp 

crv^r 


(■Z fti) 

2 a 2 


21 


(7.13) 


( 7 . 10 ) 

(7.11) 

( 7 . 12 ) 


Using a log-likelihood formulation, one obtains 


where 


A L (r) = log A s (r) 


[X\ ix 0 


L 


a 


E(^- 


(pto + im) 


k=l 


1 


= — s,(fi 0 , V), 


(7 


J 


sj(ji, v) = v - -y —)» 

k—i 


and v = ii\ — fi 0 is the size of the jump. 


(7.14) 

(7.15) 

(7.16) 


2 


(7.17) 
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But the jump time r is unknown. Therefore, its maximum likelihood estimate is 
used. This estimate is 


ri — arg max A^ = arg max S^{gi 0 ,v), 
_ . 1 <r<L 1 <r<L 


(7.18) 


leading to the detector formulation 

gL = a L(h) — maxv) ^ X. 

r 


(7.19) 


Here, k is a detection threshold useful in adjusting the false alarm rate. 


7.4.2 Another Simple Model: Change of Gaussian Variance 


Again, there is a series of L real samples from a white Gaussian process with variance 
a 2 and mean /z. In this model, the mean is known to be zero, but the variance starts 
equal to a 2 and starting with some unknown sample, the variance changes to a 2 ; the 
mean remains equal to zero. As in the previous example, this change will be considered 
an “event.” Again, the goal is to determine an “event detection” process. This process 
should determine if and when a change in variance occurs. 

Letting z represent the sequence of L samples, a log-likelihood ratio is formed: 


Az(r) — In (7.20) 

PM) 

=ln ni,(Wr 1/2 expM^M) 2 ] ^ 

nLrC™?)- l/2 exp [-(,Zk/(y 0 ) 2 ] 


where 



( 7 . 22 ) 


(7.23) 


(7.24) 
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Figure 7.4 Example of a Gaussian variance change event detection: a) input data; b) LRT 
statistic; c) decision result 

Then, as with the previous Gaussian mean model, the maximum over the variable r is 
used to determine the most likely sample at which the event, a change in the Gaussian 
variance, occurs. 

Figure 7.4 provides an illustration of this algorithm. 

7.4.3 A Multidimensional Model: Change of Gaussian Covariance 

The previous models were based on observations of a real, scalar variable £. Of more 
interest to the array-processing application would be observations of a complex vector 
process sampled over some time interval, and in particular, one that involves a change 
in the covariance of the observations. Such a model will be described in this section. 

i 

7.4.3.1 Methodology 

Consider a set of 77-dimensional snapshots taken at a sample rate F s over an interval 
/, which is divided into two subintervals I\ and I 2 of duration T\ and T 2 , respectively. 
L\ —F S T\ observations are made in the first subinterval, while L 2 = F S T 2 observations 
are made in the second. It will be assumed that the / — th snapshot consists of a complex 
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Gaussian TV-vector with mean zero and covariance K z e {K, K;}. Under hypothesis 
7i 0 all observations throughout the interval / are from a Gaussian random process with 
covariance K 0 ; under hypothesis 7~Lj all observations in the first subinterval are from a 
Gaussian random process with covariance K^, but those from the second subinterval 
arise from a process with covariance K/. The goal is a decision between these two 
hypotheses. Note that this is a multidimensional version of the scenario analyzed in 
the previous section, with additional complexities associated with the fact that the 
relevant covariance matrices are unknown, and the data are complex. 

If K* and K; are known, an appropriate log-likelihood ratio test could be set up, 
and a straightforward maximum likelihood decision process could be implemented. 
However, it is assumed here that these covariances are not known. Note that under 
either hypothesis the first subinterval is governed by covariance and mean m 0 , and 
thus these parameters can be estimated using the samples in this first subinterval. First, 
the mean is estimated: 


/\ 




(7.25) 


Using this result, the sample covariance matrix is determined: 


A 1 A 

Ko = — Y'fa ~ m„) (z - m 0 ) H . 

L\ ^ 


( 7.2 6 ). 


i =1 


In a similar fashion, one can compute the sample mean m; and covariance K; for the 
second subinterval using its L 2 samples. 

Then, one can ask if K; is close to K 0 , in which case the samples from the second 
subinterval can be assumed to arise from a Gaussian process with covariance K^; if 
K; is not close to K 0 , then presumably these samples are due to a Gaussian process 
(as assumed) with a different covariance K/. This is effectively what the LRT/GLRT 
implements. 

The appropriate likelihood ratio and decision criterion are given by 


A(Z) = 




K;|-^ exp[—Z // K7 / Z] 

K.|-^ expl-Z^'Z] 


(7.27) 


(7.28) 
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Ui 

$ V (7.29) 

H 0 , 

where Z is a matrix with columns consisting of the snapshots taken over the appropriate 
observation interval. 

Taking logarithms of both sides and absorbing terms unrelated to the collected data 
Z into the threshold, the decision rule becomes 

A (Z) = Z h [K; ] - kJ*] H Z (7.30) 

= tr ([k; 1 - K ! ]ZZ h ) (7.31) 

Hi 

^ ri'. (7.32) 

H 0 

This requires that a covariance of the data samples be formed and its components 
weighted by differences in the two sample covariance matrices. Noting that the trace 
of a matrix is equal to the sum of the eigenvalues, one can interpret A (z) as a mea¬ 
sure of the size of the weighted component sum, which can then be compared with a 
threshold. This interpretation makes intuitive sense, at least to the author. 

7.4.3.2 Example 1 

A straightforward example assumes that Af-dimensional vector data from interval I\ 
is based on L\ — F s T\ white Gaussian samples with standard deviation <7 = 1, while 
similar data from I 2 may be based on L 2 = F s T 2 white Gaussian samples with stan¬ 
dard deviation a — 2 or a — 1 . Ideally, the covariance matrices are a scaled identity 
matrix, but this example uses sample data, causing the covariances to deviate from 
this ideal. As suggested, the covariance matrix for each interval is computed in 
accordance with 



(7.33) 
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where the mean for each interval is computed in the usual way: 



(7.34) 


Note that the computation of the sample covariance matrix in equation (7.33) is not a 
maximum-likelihood estimate because the assumption of white Gaussian data samples 
implies that the sample mean, calculated in equation (7.34), is known to be zero. 
Nevertheless, the calculation of a sample mean for each interval is made and utilized 
in this example. 

A simple experiment was conducted with fifty eight-element vectors collected from 
each interval, L\ — L 2 = 50. 

The likelihood ratio defined in equation (7.32), using the collected data, was cal¬ 
culated for 100 cases. In half the cases, the data in both intervals were based on cr = 1; 
in the other cases, the data in the second interval were based on a different standard 
deviation, a — 2. The likelihood that the data from I 2 could be attributed to a — 1 
versus a =2 was calculated for the two sets of fifty cases. The results are displayed 
in figure 7.5 as a histogram. It is clear that when hypothesis Tij was correct; that is, 
the data from the second interval were based on a different standard deviation from 



Figure 7.5 Likelihood ratio histogram for two alternative covariances 
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the first interval, the likelihood ratio value was quite separable from its value for cases 
when the alternate hypothesis l~i 0 , that data for both intervals were based on the same 
standard deviation, was “true.” 

7.4.3.3 Example 2 

A even more interesting example involves a sequence of intervals, each corresponding 
to a detection interval in which the question is asked, “Is the signal environment in 
the current interval the same as the environment in the previous interval?” In this 
example, twenty intervals are simulated. For fifteen of the intervals, noise alone is 
present; in the remaining five intervals, there is noise as well as a signal, the latter 
having a power that is twice the noise power. Again, L eight-element vector samples, 
representative of an array snapshot, are taken in each interval. Again, a likelihood ratio 
is computed, comparing the probability that the data in the current interval arises from 
a distribution with the same covariance as in the previous interval. 

Figure 7.6 shows the results in the form of a temporal plot. The power of the 
signal in the environment, either 5 = 0 or 2 (scaled by 500) for each interval, and 
the value of the likelihood ratio for that interval are shown by the two traces. It is 
clear that this process would be successful in identifying those intervals containing the 



Figure 7.6 Likelihood ratio histogram for two alternative covariances 
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signal. In the next section, an even more specific detection-processing algorithm is 
detailed. 

7.4.4 Signal Acquisition by a Predictive Adaptive Beamformer 

With the background models presented in the previous sections, it will not be a great 
leap to an algorithm that has direct application to the HF environment. Consider a 
fairly general array operating environment with both temporally uncorrelated receiver 
noise and various sources of cochannel interference. Assume that a new signal s(t) 
arrives at the array, leading to the complex baseband data vector z (t) given by 

z(f)=vj(f)+i(f). (7.35) 

Here again, i(^) is a vector signal because the various array elements observe each of 
the interfering signals with steering vectors corresponding to their directions of arrival, 
and the receiver noise samples are different element to element. 

Detection of the signal s(t) must be based on statistical properties of the data vector 
sensed by the antenna array. These properties, which emphasize spatial characteristics 
of the arriving energy, are summarized by the sample covariance matrix for samples in 
the n — th block 

R zz(n) = (z(t)z H (t)Y, (7.36) 

where (•) denotes time averaging. 

Thus, detection processing should involve a comparison between the covariance 
matrices obtained under the two hypotheses: 1) a new signal has reached the array, 
and the signal environment has been modified by it; or 2) the signal environment is 
unchanged. 

Such decisions can be implemented using the fact that the signal component of 
a covariance matrix based on the data received by a narrowband array very likely 
undergoes a change in rank whenever a specific signal appears or disappears in the 
environment. If this signal is characterized by a rank-1 contribution to the covariance 
matrix, then this rank change is equal to unity; should multipath be associated with 
this signal, it is possible that the rank change will be greater. 

The observation interval can be divided into short, nonoverlapping sub-intervals, 
each shorter than the expected intermittent signal duration. Spatial statistics from 
observations made in each segment can be compared with those from the previous 
segment. If the spatial statistics of the two adjacent segments are sufficiently different, 
notably as a result of a rank change, either 1) a signal was present in the second segment 
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but not in the first, a signal arrival, or 2) a signal was present in the first segment but 
not in the second, a signal departure or end. 

Denote the duration of each segment by T. With data blocks indexed by n, define 
a series of sample covariance matrices by 

1 f T 

R zz (n) — — / rinT — x)x H \nT — r) dx . (7.37) 

T Jo 

It will be assumed that T is long enough to assure stability in this statistic and that 
the underlying stochastic process is covariance ergodic; that is, the covariance matrix 
manifests equal ensemble and temporal averages. 

/V /\ 

If the new signal is not present, R zz (n) = R^, the covariance of the interfering sig¬ 
nals alone. Note that the entire environment existing prior to the start of the new 
signal can be included in R#. 

Following the discussion in section 6.6, if the new signal is present in the n — th 
block, but not in the (n — 1) — th block, 

R zz(n) = w H S + Rzz(n — 1) = w^S + R«, (7.38) 

where v respresents the steering vector associated with the new signal. 

Should the new signal arrive midway through a block, the contribution of this 
second term is diminished by the fraction of the block without the new signal. 

If a beamformer weight vector that maximizes the output signal-to-interference- 
plus-noise ratio (SINR) is used, the eigenequation 

~Rzz(n)w(n) = X(n)R zz (n - l)w(n) (7.39) 

can provide the basis for detecting the new signal. This eigenequation can be 
rewritten as 

^zz(ti —l^^zzir^win) = k(n)w(n). (7.40) 

This is a standard, as opposed to a generalized, eigenequation. The matrix that is the 

A v A 

subject of the eigenanalysis in equation (7.40), R zz (n — J)~ 1 R zz (n), is known as the 
change matrix . 

The eigenvalues of equation (7.39) are key. If the new signal is present in block n 
and absent in block (n — 1), this eigenequation becomes 

(w^S + Rz/)w(tz) = X(n)Riiw(n), 


( 7 . 41 ) 
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leading to a maximum eigenvalue equal to 1 + y max - If the new signal is 
either block (n — 1) or block n, the eigenequation becomes 

not present in 

/\ /\ 

Riiw(n) = X(n)Rijw(n) 

(7.42) 

so that the maximum eigenvalue is equal to unity. 

Once the maximum eigenvalue X max for an interval has been determined, signal 
presence for that interval can be assessed by threshold comparison 

^ max ^ V 

(7.43) 

and the row vector of signal samples for the interval estimated by 


s( n ) = e^ z ( n ), 

(7.44) 


where z max is the eigenvector corresponding to the largest eigenvalue, and Z(n) is the 
matrix of array snapshots in this interval. 

If the signal turns on partway through an interval, R zz {n) is determined by the 
fraction of the time the signal is on and off. In particular, define ( n ONT — 

Ton)/T , and e off = {Toff ~ uoffT) / T . Then, if the signal turns on in the 
n ON ~ th subinterval, IW(^cw) ^ R« + ^oN^y^S. Similarly, if the signal turns off in 

the nopp — th subinterval, R Z z( n OFF) ~ R;; + Coafv* S. The results are summarized 
in table 7.1 [11]. 

Note that one can determine transitions to subblock accuracy using eigenvalues 
from adjacent blocks. Also, a comparison of eigenvectors can be used to associate turn 
ons and turn offs of the same signal. 

It is then appropriate to define an algorithm using the preceding methodology with 
an interval-dependent set of eigenvalues and eigenvectors based on the generalized 
eigenequation but with the relevant covariance matrices used for the previous and 
current subintervals: 

\{n)%i(n — l)w(n) = R zz (n)w(n). (7.45) 

Table 7.1 shows the relevant eigenvalues and eigenvectors under the four possible situ¬ 
ations defined by the on-off occurrences for each subinterval. This table assumes that 
n ON+ 1 < noFF • 

Note that the expressions for the maximum eigenvalues in subintervals noN an d 
n ON~\~ 1 can be used to estimate 6o7V> an d those from subintervals noFF an d n OFF~\~ 1 
can be used to estimate eoFF- Also note that the inverse minimum eigenvalue 
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Table 7.1 Eigenvalues for all possible inter-interval On-Off situations 


n 

^ max (ft) 

^min(ft) 

other X 

n ON 

1 ^ON Y max 

1 

1 

n ON + 1 

1 “1“ Ymax 

1 ^ ONYmax 

1 

1 

n OFF 

1 

2 

1 "I” ^ OFF Ymax 
i "F Ymax 

1 

ft(DFF + 1 

1 

i/(1 ^ OFF Ymax) 

1 

other n 

1 

1 

1 


can be calculated from a minor modification of the eigenequation equation (7.39), 
interchanging the roles of n and (n — 1) to obtain 

v(n)Ra(n)w(n) = R zz {n — I)w (n), (7.46) 

where v=A. _1 . This approach leads to the largest eigenvalue in v coordinates, 
leading to X m i n . Note that this is effectively operating the detection process 
backwards . 

A modification of this detection technique can be used to simplify the processing. 
If the subintervals used in the eigenequation are separated by an unused interval, a 
decision can be made cleanly between a subinterval with signal on and one with signal 
off. Such a strategy adds contrast to the processing, thereby increasing the reliability of 
the edge detection methodology. Another improvement can be made by increasing the 

A 

number of subintervals used to calculate R zz . Doing so reduces the uncertainty asso¬ 
ciated with the background covariance used as a reference in this “change detection” 
technique. With the variability in. the HF environment, any approach that reduces 
uncertainty is sure to be beneficial. 

Other modifications can be employed to reduce the computational load. For exam¬ 
ple, recognition that the trace of a matrix equals the sum of its eigenvalues, and that the 
sum of vector components with one large entry and small remaining entries is closely 
approximated by the large component alone, allows the trace of the change matrix to 
be used as a stand-in for eigenanalysis when the detection statistic is computed. Once 
a detection occurs, a full eigenanalysis can be carried out to estimate the new signal 
steering vector. 
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Sample number (rate = 80 Msps) 


Figure 7.7 Sampled RF signal at Element 1: new signal turns on at midpoint 

7.4.5 New Signal Acquisition Example 

To illustrate the effectiveness of this new signal-acquisition technique, consider a 10 ms 
scenario with three RF signals, sampled at 80 Msps. All of them have carriers close 
to the same frequency, 12 MHz, but each carrier is slightly different, has a different 
phase, and arrives from a different azimuth. Two of the signals, arriving from —30° 
and +70°, are interferers, the first with random narrowband amplitude modulation 
with 100 kHz bandwidth 1 and the second with 160 kHz BPSK modulation. The third 
signal, a steady tone arriving from +20°, turns on halfway through the timeline, at 
5 ms. All three signals have the same amplitude with a 20 dB SNR and impinge on an 
eight-element, half-wavelength array. In addition, there is simulated Gaussian receiver 
noise (standard deviation of 0.1 relative to the signals). Figure 7.7 shows the RF signal 
samples at Element 1; it is difficult to observe a change in the environment from this 
single element data record. 

These signals are received at the array and mixed down to baseband using a local 
oscillator that is 100 kHz offset from the 12 MHz carrier frequency. The resulting sig¬ 
nals are down-sampled by 200 so that the processing sample rate is 400 ksps, resulting 


1 Bandwidths used in examples are 0 times larger than is realistic for HF to simplify simulation. 
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Figure 7.8 Sampled baseband signal at Element 1 : new signal turns on at midpoint 



Segment number 


Figure 7.9 Change matrix trace detection statistic 
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Figure 7.10 Largest change matrix eigenvalue detection statistic 


in a total of four hundred baseband samples. Figure 7.8 shows the baseband signal 
samples. 

Then, the eigenvector associated with the largest change matrix eigenvector was 
used to copy the signal. The resulting baseband signal is shown in figure 7.11. The start 
of the new signal and the suppression of the interferers are obvious. 

Finally, this largest change matrix eigenvector was used to compute the beam pat¬ 
tern shown in figure 7.12. As expected, there are nulls in the interferer directions 
(—30° and +70°) and moderate gain in the direction of the new signal at 20° azimuth. 
The suppression of the interferer at —30° azimuth is 19.4 dB; the suppression of the 
interferer at +70° azimuth is 18.8 dB. One must be cautioned that the peaks of this 
beam pattern should not be interpreted as directions of impinging signals; this beam 
is intended to suppress signals deemed to be interferers because they were present in 
reference data of the first block while providing adequate gain on a new signal present 
only in the second block, which is under test for a new signal start. Relevant methods 
for direction finding will be described in chapter 8. 

The technique described in the previous section was implemented using seg¬ 
ments consisting of eight baseband samples. Fifty segments are simulated. A covari¬ 
ance matrix was computed for each segment, and from these a change matrix was 
determined for each interval. A noise-only covariance matrix was assumed for the first 
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Figure 7.11 New signal copy result 



Figure 7.12 Beam shape for new signal copy 
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segment. Then, the trace of the change matrix was computed for each segment as an 
appropriate detection statistic. The result of this computation is plotted in figure 7.9, 
and the detection of the new signal is clearly identified in the 26-th segment. For 
comparison, an eigenanalysis of the change matrix was also computed and the largest 
eigenvalue plotted in figure 7.10. It should be clear that the two detection statistics 
produce the same result: a new signal turn on occurs at Segment 26. The plots for this 
simple example would suggest that the trace of the change matrix approach produces 
fewer false alarms, but a much more detailed statistical assessment with a variety of 
scenarios and array geometries would be necessary to conclude that this holds in all 
situations. 

7.5 Temporally Oriented Detection 

In many HF sensor applications, the sensor s goals are to continuously cover a signifi¬ 
cant portion of the 3—30 MHz HF band, detecting signals that arrive, estimating their 
lines of bearing, and estimating the associated baseband signal complex modulation 
(amplitude and phase). The sensor must accomplish these tasks despite a wide variety 
of cochannel interference. 

Most sensors covering a significant bandwidth recognize that the interference envi¬ 
ronment consists of many narrowband HF signals intended for receivers other than 
the sensor at hand. But the hospitable HF propagation characteristics allow such sig¬ 
nals to reach the sensor from great distances, enlarging the number of transmitting 
sites within the sensor s range. Consequently, the interference population is large. 
Fortunately, these interferers are narrowband, and there are a relatively small num¬ 
ber of them in any narrowband slice of the environment. Thus, the most successful 
approach to interference suppression is to continuously capture the entire HF band 
segment of interest, at a high-enough sample rate to achieve high fidelity, and to 
employ digital signal-processing techniques to segment the captured bandwidth into 
many narrowband slices. If this is accomplished coherently for all antenna array chan¬ 
nels, the interference-suppressing beamformer approaches described throughout this 
book can be applied separately in each of the slices. The number of interferers in any 
one slice will be limited to a few, and an affordable number of antenna channels 
will be adequate to handle them. The consequence is that the sensor must be capa¬ 
ble of processing many narrowband slices simultaneously, leading to a serious digital 
signal-processing load. 

Architectural studies have concluded that this processing load is a linear function 
of the number of slices but a cubic function of the number of antenna elements. Thus, 
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the incentive is to reduce the bandwidth of the slices to a point where the inter¬ 
ference environment provides at most three or four signals in any slice. When HF 
multimode propagation characteristics are considered, such a slice population could 
be handled by an array of about eight antenna elements. Detailed site-specific environ¬ 
mental surveys should be done to confirm that adequate antenna numbers are deployed 
and that the signal population can be handled by the available signal-processing 
resources. 

In each narrowband slice, detection processing should be carried out using the tech¬ 
niques described earlier in this chapter, in particular those detailed in section 7.4. 
Clearly, this is a temporal process in each frequency slice within the sensor s operating- 
band segment. The slice is temporally divided into subintervals, and a detector is 
configured to examine each subinterval as it becomes available to the slice processor. If 
these subintervals are examined in time sequence, a new signal will be reliably detected 
when it occurs. Cochannel interfering signals that are not new will be suppressed by 
the burst acquisition processing described in detail in section 7.4.4. 

The signal-processing architecture must be chosen to first, allow band slicing using 
discrete Fourier transform (DFT) methods and second, to carry out the burst acqui¬ 
sition processing separately in each slice. Since the band slicing is accomplished on 
a block of wideband data, the r.esult is a simultaneously available set of samples in 
each slice. Recall that the band slicing must be accomplished for all antenna chan¬ 
nels to provide the vector of samples needed for multi-channel detection processing. 
Usually, several such “new” vector samples are desired for the burst acquisition pro¬ 
cessing, and as a result appropriate buffering is necessary before a subinterval can be 
properly processed. Then, the next subinterval for all slices produced by the DFT 
process becomes available simultaneously. The burst acquisition processor must be 
designed to run through all such slices in less time than it takes to produce the next 
new subinterval. A formidable challenge indeed! 

Figure 7.13 illustrates the data flow associated with this processing. The antenna 
signal from each element is appropriately conditioned (amplified and filtered) to 
feed individual analog-to-digital converters for each antenna channel. The ADCs 
should have appropriate sample rate and dynamic range specifications as discussed 
earlier; with todays technology, a sample rate in the range 80-100 Msps, with 16 
bits of sampling precision, would be typical. A multistage FFT process is then used 
to produce vectors with frequency-based components for each antenna; care must 
be taken to ensure that antenna channel coherence is maintained and that adequate 
FFT windowing is employed to minimize leakage between frequency cells. The result 
of this FFT process can be viewed as a snapshot matrix, with rows corresponding 
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Figure 7.13 Data flow for multichannel detection, DF and copy 


to the individual antenna channels and columns corresponding to narrowband fre¬ 
quency channels. A sequence of such snapshot matrices is produced to capture the 
long-term output of the array. Often this sequence of matrices is considered to be a 
three-dimensional “data cube.” Its contents serve as input to the detection, direction 
finding (DF), and copy processing, the results of which are considered to be the goals 
of the sensor processing. 

7.6 Spectrally Oriented Detection 

The previous section suggested that the processing to cover several MHz of receiver 
bandwidth should be carried out separately for each one of many narrowband slices. 
The signals in each slice will persist for a number of temporal epochs in each slice, 
but under typical circumstances, many slices will be empty. If one were to examine a 
short temporal epoch over a wide spectral segment, the temporal slices will cut across 
this wide spectral piece, dividing it into spectral epochs analogous to the temporal 
epochs. In effect, the extensive temporal-spectral domain captured by the receiver will 
become divided into processing cells that are very limited in both time and frequency. 
This segmentation of the captured domain is illustrated in figure 7.14. In this simple 
example, no windowing was employed in the FFT process. Three signals are present in 
the environment, a tone in frequency cell 8 that starts halfway through the illustrated 
time span, a BPSK signal at frequency cell 16, and a random amplitude-modulated 
signal in frequency cell 28; the latter two signals are present throughout. 
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Figure 7.14 Division of captured signal domain into processing cells 

Recall that the processing in a specific spectral slice utilized a detection algorithm 
that examined narrowband cells in time sequence, comparing the spatial distribution 
of signal energy at the current time to that occurring a short time before. When 
there was a substantial difference in these energy distributions, either a new signal 
had arrived or an old signal had ended. 

Similarly, if one examines the temporal cells at a given time as a function of fre¬ 
quency, there will be large gaps between narrowband slices that contain signals. This 
means that a frequency scan for a given temporal epoch will exhibit On/Off behavior 
as a function of frequency as the scan passes by the frequency cells associated with 
a particular narrowband signal. Such a scan can be processed much like its tempo¬ 
ral counterpart, producing signal edges in the frequency dimension. Such edges can 
be incorporated into a detection algorithm, thereby enhancing the overall detection 
process with a definitive way of estimating signal frequency extent. 

Armed with both sets of detection statistics, as well as the recognition that both 
produce steering vector estimates for the temporal and spectral edges, a definitive and 
robust overall detection process can be defined. 
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7.7 High-Order Statistics 

Before closing this chapter, the first covering array-processing techniques in detail, it 
is appropriate to address the class of signal-processing methods based on higher-order 
statistics. The term higher order refers to the use of statistics that involve prod¬ 
ucts of more than the two components used in covariance formulation. Third-order 
and fourth-order statistics have been specifically utilized in various array-processing 
approaches. 

First, a brief note on terminology. The reader is already familiar with the covariance, 
a second-order statistic; it is an instance of the more general statistic, a cumulant’, in 
particular, it is a second-order cumulant, computed from products of terms taken two 
at a time. 

K 2 =E[ Xt Xt +k ]. (7.47) 

In similar fashion, there are third-order and fourth-order cumulants, defined, respec¬ 
tively, by 

/C 3 (7.48) 

and 


/C4 E\Xf : XtJ r k^t- i rl‘^t J rTn\ E[XtXf^.fe\ * 

E\xfXfj ir i\ * E\Xf^_j z Xf ;-* E\Xf^.j^Xfj r i \. (7.49) 

Such higher-order approaches are intriguing because interference characterized by 
Gaussian statistics is invisible to a cumulant-based statistic, whereas most signals of 
interest are not Gaussian and are thus observable to such a metric. An analysis by 
Friedlander and Porat [7] applies a cumulant-based approach to the detection process, 
and it will be reviewed as an example in order to acquaint the reader with such higher- 
order statistics. 

A set of L samples based on a real scalar time seriesy/ is postulated to originate from 
either of two situations, or hypotheses. The first of these, H 0 , is that the time series is 
due to Gaussian noise alone; the second, H\, is that the time series is due to the sum 
of a non-Gaussian signal and Gaussian noise. More specifically, it is assumed that the 
processes { u t } and {v t } are zero-mean, i.i.d sequences, and {h x (i)} and {h w (i)} are the 
impulse responses of a moving average (MA) process. { u t } is non-Gaussian, while {v t } 
is Gaussian. The two sequences are independent of each other. 

Given sufficient data, a set of sample statistics are computed in accordance with an 
appropriate cumulant. For a fourth-order cumulant, these components are typically 
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arranged in an s 2 x s 2 matrix, where s is the number of sensors collecting data samples. 
In such a matrix, the indices i and j correspond to the rows, and k and / corre¬ 
spond to the columns. Alternatively, the cumulant components can be arranged in an 
/xl column vector obtained from this matrix using the vec operator, which arranges 
successive matrix rows into a single column. This vector can be used to generate a 
detection statistic based on the original data. 

Friedlander and Porat invoke an asymptotic result that concludes the sample cumu- 
lants have a normal distribution, in which case the detection problem can be reposed 
to decide between the two hypotheses 

(7.50) 

H \: s ~ A/"(si, Si), (7.51) 

where £ is the appropriate covariance matrix, and s is the appropriate vector of 
cumulants, under the two hypotheses. 

With this machinery, the optimal detector for deciding between two Gaussian 
hypotheses is well known, leading to a noncentral form of Gaussian random vari¬ 
ables with both the noncentrality parameter and covariance matrix depending on the 
hypothesis H 0 or H\. Numerical approaches can be used to compute the probabili¬ 
ties of detection and false alarm, thereby generating a receiver operating characteristic 
(ROC). For MA processes, the requisite sums are finite and can be carried out using 
appropriate computer code. 

In [7], an example is presented in which the MA process is a simple scaling of the 
non-Gaussian random series under hypothesis H\, and the noise is equal to the Gaus¬ 
sian random series for both hypotheses. For this data, two detectors are considered, 
one based on the sample variance and the other based on the third-order cumulant, 
£ 3 ( 0 , 0). The result is a pair of ROCs for these two detectors. A comparison of the 
two shows that the second-order detector, well known to be a maximum-likelihood 
detector, is 4 dB better than the third-order detector. 

Now that the reader has some, though limited, familiarity with higher-order statis¬ 
tical methods, it is appropriate to discuss their applicability in the HF array-processing 
environment. Key to this discussion is the nature of the statistical processes occurring 
there. It has already been noted that the cumulants defined earlier do not respond to 
Gaussian processes. In situations where the signal arriving at the sensors in an array 
has experienced a direct line-of-sight path from transmitter to receiver, the statistics 
governing the transmitted signal are common with those characterizing the received 
signal. At FIF, however, this is not the case. Propagation through the ionosphere has 
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a significant effect on the signal statistics, as discussed in chapter 2. In particular, the 
multiple instances of microrefraction occurring along the ray path tend to shift the 
statistical character of the signal away from its transmitted characteristics toward those 
of a process constructed, from a large number of individual components. It is well 
known that in the limit, such a process manifests Gaussian statistics. As a result, it is 
invisible to a cumulant-based statistical characterization. This is a second reason for 
higher-order statistical methods to be of limited interest here. 

Later chapters in this work focus on the direction-finding and copy processes, and it 
might be appropriate to consider higher-order statistical approaches to array processing 
for them. In fact, there have been several interesting journal articles that consider the 
application of cumulants to direction finding [9] and copy (signal parameter estima¬ 
tion) [10]. However, as just discussed, the use of higher-order statistics is of uncertain 
value in the HF environment. To be consistent with this viewpoint, consideration of 
such higher-order statistical methods will be limited to this brief section, which has 
focused on the detection process. 
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Array Evaluation: Direction Finding Performance Bounds 


8.1 Overview 

As a prelude to describing specific methods for estimating the direction of an arriv¬ 
ing HF signal, it is appropriate to consider performance expectations, particularly 
with respect to DF accuracy and its relationship to the sensor antenna array. Such 
expectations are most frequently quantified using Cramer-Rao bounds. Such bounds 
are described and derived in section 8.3. The Cramer-Rao bounds described here are 
based on array limitations imposed by receiver noise, and as a result, they may pro¬ 
vide optimistic performance predictions in situations where receiver noise is not the 
only performance-limiting consideration. More complex bounds, including, for exam¬ 
ple, array errors [1], can be formulated; the only one considered in this work is the 
Weiss-Weinstein bound. 

The Weiss-Weinstein bound is one alternative to the Cramer-Rao bound; it is ana¬ 
lytic in nature, and it includes effects due to the sensor antenna and the sensitivity 
of its steering vector to errors. When the signal of interest manifests a high signal- 
to-noise ratio, such effects are negligible, and the Weiss-Weinstein bound agrees with 
its Cramer-Rao counterpart. However, when the signal-to-noise ratio is low, noisy 
data can induce significant changes in the antenna response to the wave associated 
with an arriving signal, leading to large errors in the estimated steering vector and 
consequently, large errors in the line of bearing estimate that are out of line with the 
Cramer-Rao bound. The Weiss-Weinstein bound predicts that when the signal-to- 
noise ratio drops below a specific level, there is a sudden change in the DF error. This 
error can be identified with the possibility of an array sidelobe being mistaken for the 
true signal direction. Weiss-Weinstein bounds will be described in section 8.5. 
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8.2 Value of Bounds 

Earlier chapters have dealt with the HF sensor environment, followed by the previous 
chapter, which discusses the detection of specific signals in that environment; the stage 
is thus set for the balance of this monograph, the detailed products from an HF sensor. 
All these products can be viewed as parameter estimates, and as a result, parameter 
estimation methods provide an important foundation for processing the data collected 
by an HF sensor. 

It is important to separate two situations: 1) a is an unknown random parameter 
with an a priori probability distribution, and 2) a is an unknown but definite quantity. 
The estimation rules are different for each of these. 

The remainder of this chapter will be devoted to a presentation of various bounds 
that quantify how well a parameter, or set of parameters, can be estimated. Such 
bounds are useful because they provide a benchmark for the estimation process, delin¬ 
eating a set of goals for the estimator. If an estimation process is to be carried out, 
these bounds suggest how well they can be expected to perform. Perhaps the bound 
will guide the designer in determining how many measurements are required to achieve 
a specified goal, or perhaps it will indicate that the available measurements are insuffi¬ 
cient for achieving it. Thus, these bounds should be regarded as a tool for the sensor 
planner to use in the sensor design process. 

Before these bounds are presented in detail, it is appropriate to review the overall 
estimation process [2]. 

8.2.1 Single Random Parameter Estimation 

The most basic class of parameter estimation problems involves an unknown single, 
real parameter; refer to it as a. This parameter lives in a parameter space, which for this 
simple class is the real line, —oo < a < +oo. Observations are made with a sensor, but 
because these observations may riot be direct observations of a and may be corrupted 
by noise, one can say there is a probabilistic mapping from the parameter space to an 
observation space. If a set of measurements are taken, these can be represented as a 
point in the multidimensional observation space, which can be described by a vector 
z, which may be dependent on the parameter a , or perhaps by a set of parameters 
described by another vector a. Using this measurement vector, the aforementioned 
probabilistic mapping can be specified by p z \ a {z\a). Finally, the intended estimate is 
obtained using an estimation rule; this rule will be identified by the expression a( z). 
Figure 8.1 illustrates this concept pictorially. 
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Figure 8.1 Model for estimation process 


In the case where a is a random parameter, there might be a cost associated with 
the estimation error a^(z) = E[a(z) — a], where the symbol E designates expectation 
over the distribution of a. Often this cost emphasizes large errors with a quadratic 
cost function; sometimes the cost depends on the absolute value of the error, \ag\. 
Generally, it is appropriate to assign a cost that reflects the users happiness with a 
result, although often this is too subjective to assign a quantitative cost function. In 
any case, however costs are assigned, the objective is to minimize its expected value in 
a way that leads to a tractable problem with a reasonable solution. 

Assume that the a priori distribution is known and corresponds to a density function 
pa(a)> and the cost function can be expressed as a function of the difference between 
the estimate a e ( z) and the true value a. Then, one can write an expression for the 
expected cost, or risk : 

/ oo poo 

da I C[a e (z)]p a>z (a,z)dz. (8.1) 

-oo J —oo 

In this expression, z) is the joint probability of the random parameter a and the 

observation vector z, obtainable from the previously defined quantities, 

pa,x(.rt> z ) —px\a^^^)p a^A) • 

The so-called Bayes estimate minimizes the risk defined in equation (8.1). 


( 8 . 2 ) 
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Depending on the cost function, various Bayes estimate forms can be derived. If the 
cost function is quadratic, one can see from equation (8.2) that the risk corresponds 
to the mean square error. Minimizing this risk can be accomplished by differentiation 
with respect to a( z) and setting the result equal to zero. Noting that 

pa,z( a > z ) = Pa\z(d\z)p z (z), (8.3) 

one can show that the mean square estimate is 

/ oo 

ap a \ z (a\z)da. (8.4) 

-oo 

In these expressions, p a \z(^\^) ls known as the a posteriori or conditional density. Thus, 
the mean square estimate is the mean of the a posteriori density, or alternatively, the 
conditional mean . 

If the cost function is based on the absolute value of the error, the Bayes estimate 
takes a different form. Again by differentiating with respect to a(z) and setting the 
result equal to zero, one finds that the estimate is the median of the a posteriori 
density. 

Finally, one can consider a uniform cost function. For a small region of no- 
cost about the correct value for the parameter A and a uniform cost elsewhere, it 
is straightforward to show that the risk is minimized for the value of the param¬ 
eter A at which the a posteriori density is a maximum. This so-called maximum 
a posteriori estimate is frequently used and is given the special notation a ma p (z) • It 
is often convenient to utilize the logarithm of the a posteriori density when find¬ 
ing the desired maximum; this is justified by the monotonicity of the logarithmic 
function. 

8.2.2 Single Fixed Parameter Estimation 

Before considering specific methods for estimating a fixed but known parameter, it 
is appropriate to discuss how such an estimate might be evaluated. One measure 
of quality is the expectation of the estimate, where the uncertainty focuses on the 
measurement statistics. Thus, this expectation can be specified by 

/ oo 

a{x)p z \ a { 2 \a)dz. (8.5) 

-oo 
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In the event that E[a(z)\=a, for all values of a , one says that the estimate is 
unbiased ; that is, the average value of all estimates made with different instantiations of 
measurement noise equals the quantity being estimated. If this expected value is always 
different from the unknown parameter by a fixed amount, say E[a( z) — a] = b, one 
says that the estimate is biased , and if this bias b is fixed and known, an unlikely sit¬ 
uation, then it can be subtracted from the estimate to provide an unbiased estimate. 
More likely, there might be an unknown bias that is a function of the true parame¬ 
ter value. In such situations, the bias cannot be removed, and the average of multiple 
estimates is of limited value. 

Another quality measure is the variance of the estimation error, 

Var [a(z) - a] = E[*(z) - af - ^(a), (8.6) 

providing a measure of the spread in a sequence of estimates. Generally, one prefers an 
unbiased estimate with a small variance. 

An important approach to obtaining such an estimate is known as maximum 
likelihood estimation , which satisfies 

d \n(p z \ a (z\g)) 
da 

This equation seeks the unknown parameter value most likely to have caused the avail¬ 
able measurements to occur. In the simple situation in which the unknown parameter 
is observed directly but with additive, mean zero noise as interference, the most prob¬ 
able value of the noise is zero (its mean) so that one can estimate the unknown as the 
measurement itself. Since the noise is generally random, its effect can be reduced by 
averaging a number of measurements. 

This procedure can be generalized for more complex situations. The likelihoodfunc¬ 
tion p z \ a (z\o) describes the probability density for a measurement vector conditioned 
on a particular value for the unknown parameter a. Often it is more convenient to deal 
with the logarithm of this function, and since the logarithm is a monotone-increasing 
function, maximizing the logarithm of the likelihood function is equivalent to max¬ 
imizing the likelihood function itself. Finding the maximum likelihood estimate for 
the unknown parameter can then be determined by differentiating ln(p z \ a (z\a)) with 
respect to a, setting the result to zero, and solving for the unknown a. The equation 
resulting from the differentiating process is known as the likelihood equation. One 
can show that mathematically the maximum likelihood estimate is equivalent to a 
maximum a posteriori estimate in the limiting situation of no a priori information. 
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8.2.3 Multiple Random Parameter Estimation 

In many important situations, the goal is to estimate several parameters. In an HF 
sensor, one might want to estimate the azimuth and elevation of an arriving signal as 
well as its carrier frequency and amplitude. The estimation techniques described earlier 
for a single unknown parameter can be extended to cover such goals. 

The source model defined previously is extended by considering the unknown to 
be a vector a instead of a scalar quantity a. If there are K unknown parameters, a 
becomes a K-dimensional vector in a K-dimensional space. All other aspects described 
apply directly, including the two particular cases: a random parameter vector and a 
fixed but unknown vector. 

As with a single parameter estimate, one can define a cost function that depends 
on both the true values for the parameter set, or vector, and their estimates, which 
of course depend on the set of observations z. It will be convenient to focus on cost 
functions that depend only on the error, specified by 

a\ (z) — a\ 

^2 (z) — ^2 

2^r(z) — ajc 

Often the cost function is represented by the sum of the errors squared so that 

K 

C( a,(z)) = ^2 a l ( z ) = a f ( Z K( Z )- (8.9) 

i=i 

From these definitions one obtains the risk for the multiple parameter estimation 
problem, 

/ +oo p 

/ C(a e (z))^ Z) a(z, a)d%d<L. (8.10) 

-CO J 

If this integral is minimized with respect to the unknown parameter components, 
because the cost function is a sum of positive terms, each involving just one unknown, 
each term can be minimized separately, leading to 

/ +oo 

dip a | z I z ) dd (8.11) 

-oo 
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for each unknown, or expressed in vector notation. 


&ms ( z ) 


-L 


+oo 




( 8 . 12 ) 


If one considers a linear combination of several unknown parameters, b = Da repre¬ 
sented by the matrix of coefficients D, one can easily show that the least mean square 
estimate of the linear combination is the same linear combination of the estimates, 
b ms (z) = Da ms (z). This relationship will prove useful in subsequent chapters. 

As discussed in the previous section, maximum a posteriori estimation requires find¬ 
ing the value of the parameter that maximizes the a posteriori distribution, which in 
the case of multiple parameters is given by /> a |z( a l z )- Using the approach taken for a 
single parameter, one can take the derivative of the logarithm of this probability den¬ 
sity function with respect to each of the parameters being estimated and set the result 
equal to zero. With K parameters, there are then K simultaneous equations of the 
form 

dpa\Mz) = 0j i = (8.13) 

a,=a map (z) 


(8.13) 


A partial derivative operator for row vectors can be introduced to simplify the 


notation: 


V„ = 


(8.14) 


Ud 

This notation allows the equation specifying the maximum a posteriori solution to be 
given as 


V a [ln^a|z(a|z)]| a= 


a —&map ( z ) 


= 0. 


(8.15) 


In the multiple parameter estimation problem, it is useful to introduce a measure of 
spread analogous to the estimate variance in the single parameter case; the covariance 
matrix is appropriate for this purpose. If the parameter errors are jointly Gaussian, a 
perhaps optimistic assumption, their joint density is given by 


pzMc) — (l 27r l^ /2 ) lex p(~2 a f A ' 


( 8 . 16 ) 
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Equal height contours are specified by the exponent, 

afA; 1 a e = C 2 , (8.17) 

which is the equation for an ellipsoid in K dimensions. These are termed concentra¬ 
tion ellipsoids and specify the degree to which the errors are contained within this 
volume. 


8.3 Cramer-Rao Bound 
8.3.1 Single Parameter Case 


In the simple situation where there is only one real variable A to be estimated, it 
is interesting to consider a lower bound on the variance of any estimate that can 
be made of A using the observations included in the measurement vector z. Such a 
bound is stated in the so-called Cramer-Rao bound [2], summarized by the following 
theorem: 


Theorem 2. Denoting a( z) as any unbiased estimate of a, then 


Var[^(z) - a] > \E 


d \n.p z \ a {z\d) 
3 a 


~i2 


-l 


(8.18) 


or ; equivalently ; 


Var[iz(z) - a] > 



9 2 ln/ z | g Qs|^) 

da 2 



(8.19) 


In these expressions, the relevant partial derivatives are assumed to exist and are abso¬ 
lutely integrable. The proof of these bounding inequalities is straightforward; the 
reader is referred to the reference. 

These relationships lead to several important observations: 

• The basic form of the bound shows that the variance of an unbiased estimate is 
always greater than a specific value. 

• The unbiased estimate that meets this lower bound is the maximum likelihood 
estimate and is called an efficient estimate; it is found as a unique solution to the 
likelihood equation, equation ( 8 . 7 ), a m i( z). 
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If an unbiased estimate does not exist; that is, there is no way to put the likelihood 
equation into the form 

9 ln[/> 2k (zk)] I N 


[a(z) — a\k(a ), 


( 8 . 20 ) 


a ^/((z)) 


where k is a constant, possibly a function of the parameter <2, then there is no 
way to know how close the variance of any estimate will be to the bound. 

• Because the bound requires that the estimate be unbiased, verification of this 
characteristic is critical. 


It is useful to study the behavior of an estimate as the number of observations increases 
without limit. For example, it has been shown [2, 3] that the solution of the likelihood 
equation converges to the correct value of the unknown parameter. Such an estimate 
is termed consistent. 

It is also worthwhile to note that when an efficient estimator does not exist, there 
may be an unbiased estimate with a lower variance. Because there is no general rule 
for finding such an estimate and because the specific approaches that have been found 
tend to be complex, most attention is given to the maximum likelihood approach. 


8.3.2 Linear Array Examples 

As an illustration of this bound, it is appropriate here to consider the potential per¬ 
formance of a specific antenna array for estimating the direction of an emitter; that 
is, performing direction finding, or, as commonly denoted, DF. As discussed in the 
previous chapter, when there is one signal impinging on the array, the vector of signals 
received by the array elements is specified by 

z = as(a) + ^. (8.21) 

In this example, the array of elements is a straight line, and the direction of arrival a is 
specified by a = cos(0), where 0 is the geometric angle between the signal direction 
and the array axis. It is also assumed here that the signal amplitude a is known. In more 
realistic geometries, the array elements might be distributed in a planar geometry, in 
which case a must be considered as a two-vector, and the evaluation of the Cramer- 
Rao bound must consider the errors in both the azimuth and elevation directions. Also, 
the signal amplitude might be unknown, and a parameter to be estimated, increasing 
the dimension of the unknown parameter vector to three or four, depending on the 
assumption about signal phase. If there are K signals, each would have two direction 



194 Chapter 8 


parameters and two amplitude parameters, leading to 4K parameters. But the present 
focus is on a simple example in which there is only one unknown parameter, the signal 
direction defined in sine-space, and the sensor obtains L data samples. 

With the simplifications associated with the present example, the bound on the vari¬ 
ance inverse expressed in equation (8.19), defined in section 8.3.3 as the Fisher infor¬ 
mation matrix when multiple parameters are involved, can be computed as follows: 


J = E 




Since the only unknown parameter is a = u — cos(0), 


( 8 . 22 ) 


\np{z\u)—\n tc L \y\ 1 exp{ — [z — s{u)] H y l [x — s(u)]} 


Taking the required derivative leads to 

3 \np 


da 


[\np(z\u)] 


= 2Re| 


SO 


that 


J = 2Re 


r 3s^ 

3 u 


' '[z-sO)]. 

(8.23) 

d/ 1 , ) 

y [z s («)] \ 

(8.24) 

i 3s 

y — • 

3 u 

(8.25) 


But for the linear array addressed in this example, if a is the signal voltage and \{u x ) 
is the array response vector for an impinging signal with a direction cosine %, 


so that 


s = a \{u x ) 

~ 1 

exp i 2 jvu x X 2 /h 


= a 


exp i2rt u x xjs[ / X 



(8.26) 

(8.27) 


(8.28) 
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Here u x = cos 0, and □ denotes the so-called Hadamardproduct in which components 
of two matrices are multiplied element by element. Using this derivative in equa¬ 
tion (8.22) and noting that the noise covariance matrix T is diagonal with elements 
equal to a 2 , the bound on inverse variance becomes 



(8.29) 


where X is the wavelength of the impinging signal carrier. Thus, for a linear array, for 
which there is only a single unknown signal direction, the bound becomes a scalar and 
equal to the product of the signal-to-noise ratio (2a 2 /cr 2 ) and a quantity that can be 
seen as the RMS aperture of the array. Thus, utilizing equation (8.18), the bound on 
the variance becomes the inverse of this quantity; that is, = J~ l . 

When applying this relationship, it is important to note that the array centroid 
should be used as the origin of the coordinate system. Otherwise, the excessively large 
value of the sum in equation (8.29) will compromise the closeness of the resulting 
Cramer-Rao bound. 

This expression can be used to produce a plot showing the limiting performance 
of a linear array as a function of the array aperture and the signal-to-noise ratio of 
the impinging signal, as shown in figure 8.2. In this plot, the array aperture varies 
with the number of elements in the array because the spacing here is assumed to be 
half wavelength. As will be discussed in the next chapter, it is well known that the 
array elements must be no closer than one-half wavelength if angle estimates are to be 
unambiguous. The curves given in this figure will be useful later when an example of 
the Weiss-Weinstein bound is considered. 

To make the example more interesting, an alternative array geometry considers a so- 
called thinned array, in which two of the elements are spaced at a half wavelength, but 
the remaining elements are spaced farther apart. If a geometric spacing plan is used, 
with a growth factor ofg = 1.3, the total length of a five-element array becomes 3.662A, 
rather than 2X, 1.831 times the aperture of a half-wavelength array with the same num¬ 
ber of elements. Many such spacing schemes have been explored and reported in the 
literature [5]. Figure 8.3 provides a comparison of the expected performance of this 
specific five-element thinned array with a five-element half-wavelength-spaced array. 

This plot illustrates several important points. First, the lower bound decreases as 
the element signal-to-noise ratio increases; this is because the noise-induced errors 
are smaller relative to the signal, allowing the estimate of the signal steering vec¬ 
tor to be more accurately estimated. Second, the thinned array provides a larger 
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Element SNR (dB) 

Figure 8.2 Line-of-bearing Cramer-Rao bound for linear array geometry 



Figure 8.3 Comparison of Cramer-Rao bounds for half-wavelength spacing and thinned 
array geometries, five-element arrays 
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aperture, leading to a better estimate of the signal direction. Third, as the signal-to- 
noise ratio decreases, although the errors grow due to the increased noise level, the 
advantage of the larger aperture attibuted to thinning decreases. These conclusions 
will be supported with data as the results from example direction-finding techniques 
are discussed. 


8.3.3 Multiple Parameter Case 


A similar Cramer-Rao bound can be obtained for situations involving multiple param¬ 
eters. First, it is necessary to generalize the Fisher information matrix. As with the 
single parameter case, partial derivatives form the basis of the bound. Since there is 
a vector of measurements and a vector of unknown parameters to be estimated, it is 
appropriate to consider how the measurements are influenced by the unknown param¬ 
eters. Because such influences may depend jointly on more than one parameter, these 
influences are quantified by the product of two partial derivatives, each with respect 
to one of the unknowns. More specifically, consider the K x K matrix J defined by the 
set of elements of the form 


Jij — E 


9 ln/> z | a (z]a) 91n/? z | a (z|a) 


d d Uj 


r s2 


= -E 


9 lnp z | a (z|a) 


9 CL^ 9 CLj 


(8.30) 

(8.31) 


Then, these elements can be used to form the Fisher information matrix J as an array 
of these elements. 

Equivalently, one can use the gradient operator to specify the Fisher information 
matrix directly, 

J = ^[V iZ [ln^| a (z|a)]V a [ln A | a (z|a)] r ] (8.32) 

= -^[V,(V.[ln/ z)a (z|a)] r )]. (8.33) 

Using this machinery, one can state the Cramer-Rao bound 



== Var[aj(z) — ai\ >J U , 


(8.34) 


where J n is the ii-th element of the inverse of the K x K matrix J. 

When a set of unknown parameters, specified by the vector a, is to be estimated, 
measurement noise and the extent to which the parameters are observable in the 
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measurements become the primary factors in determining estimation error. The Fisher 
information matrix quantifies these relationships. 

8.4 Minimum Mean Square Error Bound 


It will prove useful later to be aware of a bound that applies to the mean-square error 
associated with an estimate of a random parameter a, a( z) [2]. The randomness of 
a implies that expectations should be taken over its value as well as over the noise 
contributions to the measurements. 

For such a random variable a , and z, a vector of observations used to estimate a , 
the mean square error satisfies 


E\a{ z) -a] 2 > E 


9 ln(/> z , a (z, a)) 
3 a 


n 2' 


-l 


r 


-E 


9 l n(p z , a (z, a)) 
3 a 2 


(8.35) 


Note that the probability density is joint over the random variables a and z and that 
the expectation is therefore over these two variables as well. Of course, the partial 
derivatives must exist and also be absolutely integrable with respect to both variables. 

It is useful to note that the least mean square estimate and the maximum a poste¬ 
riori estimates agree when the bound is met. This can be shown as follows. The first 
expressions in 8.35 can also be written as 

(8.36) 

In this expression, there is equality if and only if, for all z and a , 


E[a( z) - a\ 


> 


E 


9 2 ln/>z|aOk) 
da 2 


-E 


3 2 In p a (a) 
da 2 


31n/>z,^(z, ^) 

da 


— k[a{ z — a)]. 


(8.37) 


Differentiating with respect to a a second time provides the condition 


3 2 \np x , a (z,a) 


da 2 


= —k. 


(8.38) 


or, in terms of the a posteriori density, 


9 2 ln/> z | a (z|#) 

da 2 


= —k. 


(8.39) 



Array Evaluation: Direction Finding Performance Bounds 199 


If this expression is integrated twice and rewritten without the logarithmic function, 
the result is 

pz\a(z\a) = exp(— ka + Cja+ C£). (8.40) 

This result is important in that it allows the conclusion that the a posteriori probability 
density p L \ a (j\d) must be Gaussian for all z in order for an efficient estimate to exist. 
From this logic, it follows that the least mean square estimate a ms {z) and the maximum 
a posteriori estimate a ma p(x ;) must agree whenever an efficient estimator; that is, one 
that satisfies the bound, exists. 


8.5 Weiss-Weinstein Bound 


8.5.1 Description of the Bound 


Whereas the Cramer-Rao bound provides a useful benchmark for many array geome¬ 
tries, when the element signal-to-noise ratio is low, array geometries that manifest high 
sidelobes in their array response pattern exhibit much greater error than the Cramer- 
Rao bound predicts. The Weiss-Weinstein bound (WWB) provides a mechanism for 
predicting this behavior. As with the Cramer-Rao bound, the WWB is stated in terms 
of a lower bound on the covariance matrix that characterizes estimation errors. This 
bound states that 


EQa - u) (u - u) T > Hr - 1 H T = St. 


(8.41) 


Here u is the K x 1 vector estimate, u is the K x 1 true parameter vector, H is a K x T 
matrix with columns h* that represent perturbations of the true parameter vector u, 
and r is a matrix to be defined next. T is expressed in terms of a likelihood ratio £ of 
the observations, given the true parameters, for these parameters being u + h and u, 
respectively: 


£(z;u +h, u) = 


/>(z,u + h) 
p(z,u) 


(8.42) 


This likelihood function is only defined on the set of values for u such that />(z,u) > 0. 
To determine the matrix T, consider a real random vector w with the i — th element 
equal to - 


a £ 7 / 2 (z; u + hj, u) — £ 7 / 2 (z; u — hj, u) 
£[£ 7 / 2 (z;u + h z ,u)] 


(8.43) 


The matrix T is then just the mean square error matrix of the random vector w, 
r =f[ww T ]. r will be assumed here to be positive definite. 
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T is the number of perturbation vectors. This number and the size of the pertur¬ 
bation associated with each of them are completely arbitrary. Every choice produces a 
valid lower bound on the mean square error matrix. One can show that adding another 
perturbation vector can only increase the bound [4]. 

Computation of the bound is fairly complicated, but it can be simplified by the 
definition of two functions 

A L (h„ hj) = [(MP + 7)detR]- 1 , (8.44) 

where 

R = I- \ M p, 1 {v(u + h;)'/ 7 (u + h/) + v(u + (u + h,) }, (8.45) 

and 

5(h z -, hj) = f p l / 2 (u + h l )p l / 2 (u + hj)du. (8.46) 

Ju 

Also, define their product 


C(h„ hj) — ^z(h,-, h hj). 


Then, the elements of the matrix T are given by 



C(hi, hj) + C(-h/, -hj) - C(hi, -hj) - C(-h„ hj) 

C(ki,0)C(hj,0) 


(8.47) 


(8.48) 


When applying these methods to obtain the WWB, one must take care in identify¬ 
ing the domain of integration for the integral in equation (8.46). If, for example, 
the problem involves a line array, with a one-dimensional characterization of the 
direction of arrival, this domain will involve linear intervals on the portion of the 
real line defined by [—1, +1]. If the problem involves a planar array, the region of 
support involves the intersection of three unit disks. Details are provided in the 
reference [4]. 


8.5.2 Linear Array Example 

As with the Cramer-Rao bound, the example of a linear array will be used to illustrate 
the WWB. For the thinned array example of section 8.3.2, the results are shown in 
figure 8.4b. Two observations can be made. First, the WWB is asymptotic to the 
Cramer-Rao bound as the element signal-to-noise ratio increases. Second, when the 
element signal-to-noise (SNR) ratio is close to unity, the WWB starts to diverge from 
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the Cramer-Rao bound and becomes significantly different when the element SNR 
is —2dB. As indicated earlier, this divergence is indicative of the estimation process 
sensitivity at low signal-to-noise ratios. When the SNR ratio is low, the estimate of the 
complex steering vector is noisy, and as a result, a comparison of this noisy vector with 
the noise-free array manifold can result in a misfit at an incorrect direction of arrival. 
When the array is sparse, the manifold includes a number of incorrect directions of 
arrival where it is similar to that for the correct direction. As a result, the noisy steering 
vector estimate can be a better fit to one of these incorrect directions. 

Comparisons of the WWB results for thinned and fully populated arrays, seen 
by comparing the two plots in figure 8.4, confirm that the high array sidelobes 
associated with the thinned arrays are responsible for these sensitivities and con¬ 
sequently, the large jump in the WWB in this regime. At operating frequencies 
where all element spacings are a half wavelength or less, there are no interelement 
ambiguities, but there are still array sidelobes. Thus, the WWB computation can 
still manifest an increase over the Cramer-Rao bound, albeit showing a smaller 
increase than with a thinned array and an initial discrepancy at a slightly lower 
signal-to-noise ratio. Comparisons are complicated because the thinned array aperture 
is larger for a given number of antenna elements than for a half-wavelength-spaced 
array. 

The referenced report [4] covers both these bounds in detail and provides compar¬ 
isons of both for the common HF antenna arrays described in the next section. The 
reader is encouraged to consult the reference and perhaps compute both bounds for 
specific arrays of interest. 

A, 

8.6 HF Array Geometry 

The arrangement of the antenna elements in an area represents a significant chal¬ 
lenge, primarily because the wavelengths at HF are so large. Conventional wisdom 
in the array-processing community has focused on arrays in which the elements are 
spaced a half-wavelength apart. At the low end of the HF band, 2 MHz, this cor¬ 
responds to an element spacing of 150 m, making a uniform linear array of, say, 
eight elements, more than 150(8 — 1) = 1.05 km in length. Even a circular array with 
eight elements would be 380 m in diameter. Clearly, such arrays are inconvenient 
to deploy and construct. A variety of approaches have been used to mitigate the 
HF array size issue over the years. The adaptive array techniques described through¬ 
out this book have brought a new perspective, and it is appropriate to discuss 
it here. 



Lower bound on sin (angle) error Lower bound on sin (angle) error 


202 


Chapter 8 



(a) 



(b) 


Figure 8.4 Comparison of Weiss-Weinstein and Cramer-Rao bounds, five-element array: 
(a) filled, (b) thinned 
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8.6.1 Array Size and Element Distribution 

Basically, the schema for choosing the number and distribution of antenna elements 
in an array is a trade-off among several factors: 

• Line-of-bearing estimation accuracy increases linearly with aperture diameter. 

• Available real estate constrains array dimensions. 

• Ambiguous line-of-bearing measurements arise when elements are spaced further 
than a half wavelength apart. 

• Frequency coverage requirements suggest element spacing based on the highest 
frequency (shortest wavelength). 

• Two-dimensional arrays are needed when signals can arrive from a wide angular 
sector. 

• The number of array elements controls the degrees of freedom in array signal¬ 
processing and hence, the number of interferers that can be suppressed. 

• Costs associated with sampling the array element signals, including amplification 
and filtering, increase linearly with the number of array elements; array signal¬ 
processing load increases as the cube of this number. 

A number of studies that consider these design issues have been published, and the 
reader is referred to the references for the details [4, 5]. One approach that has been fre¬ 
quently used to address the wide frequency extent of the HF band is to segment it into 
two subbands and to design separate arrays for each subband with an element place¬ 
ment that minimizes ambiguities for frequencies in the band associated with its array. 
When goniometers (commutating beamformers) were the only array signal-processing 
technique in use, these arrays were circular, with equally spaced elements. Ambiguity 
control at the upper frequency edge of the band dictated a half-wavelength spacing, 
and a trade among available real estate, line-of-bearing accuracy requirements, and 
cost determined the diameter of the array and hence, the number of array elements. 

For example, an array for the upper subband might utilize element spacing deter¬ 
mined by the 30 MHz frequency, with a half-wavelength element spacing of 5 m. 
An array with sixteen elements would then have an 80 m circumference and about 
a 25 m diameter. If the subband segmentation is based on geometric similitude, the 
frequency dividing the bands would be about 7.7 MHz, wavelength 39 m. For the 
lower band segment, a sixteen-element array with half-wavelength element spacing 
would correspond to a diameter of 100 m, about a football-field-sized area. 

With the advent of adaptive array techniques and the potential for more flexible 
signal-processing techniques than utilized with fixed beamformers or goniometers, 
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geometric array element placement constraints were softened. Designers were will¬ 
ing to employ arrays with variable element spacing. Such arrays would utilize one pair 
of elements with spacing chosen to avoid ambiguity at the highest frequency in use 
with the array but then to increase the spacing among elements as more were added 
to the array. 

One example of this approach employs exponential spacing; another employs log¬ 
arithmic spacing. When wide sector coverage is a requirement, these spacings can be 
accommodated with either a pair of crossed arrays or a spiral array with increasing 
element spacing for elements further from the origin. These options will be discussed 
in sections 8.7.2 and 8.7.3. 

8.6.2 Quantitative Array Assessment—Degrees of freedom 

Once the physical aperture for an array has been decided, that aperture and the 
required range of operating frequencies can be used to focus on the number of useful 
antenna elements. With a constrained aperture, the number of array elements cannot 
be usefully increased without limit; at a point, the additional signals from the added 
elements become correlated and result in little additional information from the array 
processing. This section describes a mathematical model for assessing such limits on 
the element count in an array [6]. 

This technique utilizes a composite incoming wave front model and an array signal 
correlation matrix based on this composite signal impinging on the array. Eigenanalysis 
of this correlation matrix demonstrates that only a limited number of array elements 
support the collection of independent signals; more elements produce more signal 
channel outputs, of course, but these additional signals are highly correlated with each 
other and can be represented by a smaller number of “eigensignals.” Thus, there is little 
point in adding these additional elements to the array. 

The array-processing concepts described in chapter 6 are important to the devel¬ 
opment of this array assessment approach. The concept of a “steering vector 5 was 
introduced as a vector with components equal to the relative phase delay of each array 
element. Because this phase delay is a function of the element location and the arriv¬ 
ing signal direction, the steering vector becomes a function of these quantities. In 
particular, if the N element locations are specified by an N x 3 matrix V e , with rows 
containing the x—y — z coordinates of the n — th element location, and a direction 
vector u corresponding to an azimuth angle (j) and elevation angle € is given by 

u = [cos 0 cos 6 sin 0 cos c sine], 


(8.49) 
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then the complex phasor representing the phase delay at the n — th element is 
specified by 

v„((j),e) = exp[—z'2jrp”u :r ], (8.50) 

where p” is the row vector corresponding to the n — th row of P^. The vector u(0, e) 
is defined as the 1x3 vector for a signal arriving from [<p,e] in accordance with 
equation (8.49); it is the same for the entire array of N elements. 

This analysis postulates a set of K incoming signals, each with its own modulat¬ 
ing waveform s^(t) from its own direction of arrival and therefore identified by the 
subscript k , to indicate this signal-specific direction. With the usual signal model, 
the received vector of outputs from the array elements, when these signals arrive, is 
given by 

z (t) = Vs(t) + n(t). (8.51) 

Here the additive receiver noise sample vector n(^), the signal sample vector s(t ), and 
the N x K matrix V, with columns consisting of the steering vectors corresponding 
to the K signal directions, have been introduced. 

If it is assumed that the signal modulations are all uncorrelated, the noise is white 
(independent samples in a given channel), and the channels are uncorrelated, one 
can form a correlation (covariance) matrix R from these N time sequences, which, 
as described in chapter 6, is modeled using the signal direction vectors, their relative 
powers, and the known receiver noise characteristic. 

R = E[z{t)z(t) H ]=VSV H + G 1 !. (8.52) 

In this expression, the K x K diagonal (because the signals are uncorrelated) matrix 
S contains the expectations of the magnitude squared of the signal waveforms 
E[\s k (t)\ 2 ]; that is, the powers of the individual signals, now denoted by pj,. Note 
that the signal component of R is the N x N square matrix S given by 

K 

s = VSV H = Pkn yX . (8.53) 

k=\ 

It can be spectrally decomposed to provide a set of eigenvalues {A/,} with their 
corresponding eigenvectors {e/,} 


K 


S ^ ^ • 


( 8 . 54 ) 
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The eigenvectors {e^} span a space of dimension N, but the space spanned by the K 
signal vectors {v^} is a if-dimensional subspace of it, known as the signal subspace. This 
space is identical to the space spanned by the K eigenvectors of S having nonzero eigen¬ 
values. The remaining (JSf — K) eigenvectors span the so-called noise subspace. Because 
all eigenvectors of a Hermitian matrix are mutually orthogonal, the signal vectors {v^} 
are orthogonal to it. This fact will be exploited later in the MUSIC algorithm. 

Since the steering vectors and the eigenvectors of S live in the same space, the 
steering vectors can be represented as a linear sum of these eigenvectors. 

N 

Vk ^ ^ O^kn^Ti' (8*55) 

n —1 

Here, the { a^ n } are the projections of the k — th steering vector onto the n — th 
eigenvector. But, the definition of the eigenvalues of S provides that 

K 

Uz ^77 Se„ ^ ^ Pk\^k?i\ • (8.56) 

k=i 

That is, each eigenvalue is equal to the sum of all the projected powers of all steering 
vectors onto that eigenvalue’s eigenvector. If an eigenvalue is small, that dimension of 
the measurement space corresponds to one in which there is almost no power from 
any wave front. Thus, these dimensions are not useful in providing information about 
the signal environment. 

Based on this concept, one can define a set of wave fronts, compute the corre¬ 
sponding correlation matrix, and analyze the roll-off of its eigenvectors. This set of 
wave fronts should be chosen in a way that will suggest, for the situation in which the 
array will be used, the maximum number of useful antenna elements that should be 
used. 

The process of generating this “testing wave front” involves defining a coverage 
region, an aperture, and a frequency range, as discussed earlier. Then a set of K steering 
vectors, uniformly distributed over the coverage region, is defined. Each is assigned the 
same power level. Then, referring to equation (8.53), 

K K 

S = Pk^k^l ~ p Xi > ( 8 - 57 ) 
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or, in an element-by-element specification, with common power levels for all 
wavefronts 

K 

Smn -P/h y ’”k Y t„- (8-58) 

k=l 

For the most general case, assuming that the impinging wave vectors are uniformly 
distributed over an elevation angle range [0, U] and an azimuth range \—W, +W], 
the elements of the S matrix become 

i i r u r w 

Smn = W77 / / exp{-z27r/A[p^-u(0,c)-p^-u(0,c)]}^0^c. (8.59) 

IW U J o J-w 

The geometry of the array is specified in the V e matrix. 

Shnidman [6] has evaluated several array geometries, including the linear uniform 
array, a linear log-periodic-spaced array, a linear random array, and a two-dimensional 
circular array. The uniformly spaced linear array is presented as an example in the next 
section. 

8.6.3 Uniform Linear Array Example 

Consider a three-wavelength linear aperture as an example. Half-wave spacing would 
dictate six antenna elements be used, but the example quoted from [6] considers nine 
and twelve elements, with the goal of understanding whether additional, more closely 
spaced elements, might provide capabilities to separate more signals. An azimuthal 
sector of ±90° is assumed and used to generate the testing wave front. 

For both nine and twelve elements, the S matrices are computed and eigen ana¬ 
lyzed. Figure 8.5 shows the results, which indicate that there is a sharp roll-off in the 
eigenevalue starting with the seventh, regardless of the number of elements. This obser¬ 
vation suggests that once an array includes enough elements to make the spacing as 
close as one half wavelength, additional elements cannot provide a capability of han¬ 
dling more than a number of that is only slightly greater than the number of elements 
available with half-wavelength spacing. 

8.6.4 Array Geometry Assessment—Pattern Comparison 

The previous sections addressed the question of the number of elements that should be 
employed in an array to extract the requisite information from the impinging signals 
without encumbering the processor with excessive elements that are redundant and 
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Figure 8.5 Eigenvalue rolloff for nine- and twelve-antenna elements, aperture = 3k 

hence of little value. An alternative approach to evaluating array geometries focuses on 
the placement of the elements, once an appropriate number has been determined. 

Although the array is to utilize its elements to collect information about particular 
signals of interest while suppressing interference from other signals that may or may 
not be signals of interest, the method addressed in this section concentrates on making 
unambiguous direction of arrival measurements. This implies that the array should not 
manifest sidelobes , strong responses from alternate directions when a beam is generated 
to focus on one particular signal in the environment. 

The approach to be described here is conceptually simple. The designer desires to 
operate the array over a wide band of operating frequencies and over a large sector of 
angular space. He/she is also interested in maximizing the aperture used to produce 
line-of-bearing estimates. Thus, the goal is to generate a set of beams that are as nar¬ 
row as possible, cover the required sector, and possess the lowest possible responses 
in directions other than the steered direction. The degrees of freedom are limited to 
the x- and y-coordinates of the element locations. (Three-dimensional arrays are not 
considered because they are generally impractical at HF.) 

Assume a given N x 2 element placement matrix P e , containing x and y element 
coordinates in its rows. If one assumes, for a given frequency, that one can form a beam 
in a specified direction and then compute its response to signals from other directions, 
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this beam pattern can be examined to determine its beamwidth and its maximum side- 
lobe. If one then forms a number of beams in this way, for the specified frequency, to 
cover the entire desired coverage sector, one can average the two performance measures 
in various ways to obtain a figure of merit for the original element placement. 

Often the design requirements specify an appropriate weighting of beamwidth 
and sidelobe performance (w in the figure of merit computation in equation (8.60)). 
Assume that this can be done, leading to a single score for this element placement. Such 
a procedure can be repeated for another frequency in the range of interest, and another, 

and another_ The results of these frequency-driven trials can again be averaged, 

producing a broadband score for the specified geometry. 

Once this first array placement is fully evaluated, the evaluation process can be 
repeated with another array geometry that can be compared with the first. Many such 
geometries can thereby be evaluated and compared. The process is computationally 
costly, but it is a one-time cost. Once an effective geometry is chosen, an array with 
this geometry will be built, and no further changes are likely to be made. 

This approach can be specified mathematically: 


arg max 

P p 



S (P e ,f, *) 
T(P„/,*) J 


(8.60) 


where is shorthand for a two-dimensional direction vector combining the azimuth 
and elevation parameters 0 and 6. S is the maximum sidelobe for the element place¬ 
ment V e as a function of frequency and signal direction vector, and T is the array 
beamwidth as a function of these parameters. 


8.6.5 Pattern Computation Example 


This example will illustrate the pattern computation approach to array design con¬ 
ceptually without the exhaustive computations needed for a complete demonstration. 
Consider a linear array of four antenna elements arrayed along the x-axis. Table 8.1 
shows the x-coordinate in meters of the antenna elements. At 10 MHz, the midpoint 
of the frequency range of interest, the wavelength is 30 m. Examination of this table 
indicates that the first configuration is relatively conservative in its stretching of the 
element spacing, while the third configuration is quite aggressive. 

It is assumed in this example that the only SOI direction of interest is at 20° azimuth 
but that frequencies between 9 and 11 MHz are to be used. Figure 8.6 shows the 
antenna patterns for the three array configurations at the lower edge of the frequency 
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Table 8.1 X-coordinates (meters) of the four antennas in the three example configurations 


Configuration 

X\ 

X2 

x 3 

X4 

1 

0 

13 

30 

52.5 

2 


15 

45 

75 

3 

0 

13 

35 

120 



Pattern of Confia. 3 at 9 MHz 



Angle (deg) 


Figure 8.6 Antenna patterns for three-element configurations at 9 MHz 

range (9 MHz). It is clear that Configuration 3 has a narrower beam width but that the 
sidelobe characteristics are much poorer than those of Configurations 1 and 2. 

If nine cases, specified by the three frequencies of interest and the three antenna 
element configurations, are identified, one can conceptually examine the two tables 
of beamwidth and highest sidelobe (3x3 entries each), apply appropriate weighting 
to these two factors of concern, and select a configuration that provides an acceptable 
compromise between beamwidth and highest sidelobe level using equation (8.60). 
For this particular example, the two tables are shown below. Table 8.2 shows the 
beamwidth (in degrees) for each of the three configurations at each of the three fre¬ 
quencies; table 8.3 shows the level of the highest sidelobe (linear scale), which should 
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Table 8.2 Beamwidth results for the nine cases examined 


Frequency 

Beamwidth (deg) 

Config 1 

Config 2 

'v 

Config 3 

9 MHz 

22.9183 

16.0428 

11.4592 

10 

20.6265 

13.7510 

11.4592 

11 

18.3346 

13.7510 

9.1673 

Table 8.3 

Sidelobe levels for the nine cases examined 



Sidelobe level (dB) 



Frequency 

Config 1 

Config 2 

Config 3 

9 MHz 

-2.03 

-2.07 

-3.60 

10 

-1.99 

-1.79 

-5.38 

11 

-1.85 

-2.19 

-4.84 


be compared with the mainlobe gain of four. If beamwidth is most important when 
weighing the two design factors, Configuration 3 would be chosen for implementa¬ 
tion. On the other hand, if the sidelobe levels associated with the larger and more 
thinly arranged antenna elements are unsatisfactory, Configuration 1 should be cho¬ 
sen. Note that in this example, the sidelobe levels for Configuration 2 are not much 
better than those of Configuration 3, making Configuration 2 a poor compromise. 

8.7 Some Common HF Antenna Examples 

In this chapter, several techniques for evaluating arrays have been described and 
applied to the linear array geometry. However, such arrays have an important lim¬ 
itation arising from their one-dimensional configuration. An array could be more 
effective in achieving its interference suppression and direction-finding goals if it man¬ 
ifested a two-dimensional geometry. In such a case, the array manifold would become 
a two-dimensional structure, allowing three-dimensional beams to be formed; such 
beams would have capabilities to focus the array output from a conical rather than a 
sector-shaped area, thereby improving interference-suppression performance. In addi¬ 
tion, the two-dimensional array would produce a two-dimensional line of bearing, 
leading to improved geolocation accuracy at the least and perhaps, as will be discussed 
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in chapter 10, an opportunity for a single-site geolocation capability. This section 
describes several of the more common two-dimensional array geometries and discusses 
both their potential advantages and the issues associated with their implementation. 

8.7.1 Circular Arrays 

Circular arrays are the most common form of a two-dimensional array. These arrays are 
constructed from a set of identical antenna elements that are either omnidirectional 
or broad-beamed with the beam pointing outward. They are designed to possess a 
very important feature: performance that is essentially independent of the direction of 
arrival associated with the signal of interest. However, almost all circular arrays in use 
today employ a commutating switch, known as a goniometer , that selects a segment of 
the circle to be active at any instant. Typically, the commutating action picks off the 
signals from antenna elements in the selected segment and uses a passive beamformer 
to combine these signals to create a narrow beam with a radial focal direction. As 
the commutator action continues, the segment of selected antenna elements changes 
and so does the resulting beam direction. The result is a rotating, relatively high-gain 
beam. Rotation rates are typically several times per second, allowing sampled angular 
coverage over a complete azimuthal circle. Figure 8.7 illustrates the concept associated 
with the goniometer approach to a rotating narrow beam. 

Because any direction is sampled, rather than continuous, such a rotating beam 
would be neither successful at capturing a communication signal nor appropriate for 



Figure 8.7 Schematic of an eight-channel goniometer-based circular array beamformer 
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detecting a signal from an arbitrary direction. But such arrays have been employed suc¬ 
cessfully for highly accurate direction finding. The associated functions of detection 
and “copy” are carried out by auxiliary antenna structures: a single omnidirec¬ 
tional antenna element for detection and either this same element, or a nonrotating 
commutator, for copy. 

When circular arrays are designed for use with goniometers, the array diameter and 
the number of elements are generally chosen using the conventional approach of half¬ 
wavelength spacing between elements. The design wavelength is based on the shortest 
wavelength within the planned frequency coverage of the array. If the array is to cover 
the entire HF band, from 2 to 30 MHz, the shortest wavelength, 10 m would suggest 
a 5 m antenna spacing. Should the circular array be planned with fifty elements, the 
250 m circumference, about 80 m in diameter, would provide a beamwidth (assuming 
the commutator selects eight elements) of about 15°, not an unreasonable beamwidth. 
However, if the same array were also to be used at 2 MHz, the beamwidth would 
be fifteen times greater (based on the frequency ratio), almost no beam at all. To 
circumvent this issue, such arrays are designed to operate over a smaller frequency 
range, and two circular arrays are deployed, one inside the other. With two arrays, 
the band segment divide is chosen by a rule of thumb based on the square root of the 
total frequency range associated with the sensor design. Thus, an upper band segment 
would cover the frequency range 8 to 30 MHz. 

In this more modern era, a circular array could be designed for use with adaptive 
array processing. Active receivers would, of course, be required for each active antenna 
element. But the direction-finding approach used with such an array permits the use of 
a thinned array, as discussed earlier in section 8.6. Applying the concept of a thinned 
array to a circular array is straightforward; a substantial number of the array elements 
are dropped. But in order to avoid angle-of-arrival ambiguities due to antenna grating 
lobes, the elements chosen should not have regular spacing. At this point in time, the 
most successful approach to choosing the locations of the antenna elements involves a 
trial-and-error process in which a potential set of antenna element locations is compu¬ 
tationally evaluated for grating lobes over the entire field of regard given in the sensor 
specification. 

8.7.2 Crosses and L-Shaped Arrays 

It should be clear from the discussion in the previous section that the selection of a 
suitable array geometry is a complex trade-off among 1) aperture size, which drives 
line-of-bearing accuracy and is strongly affected by available real estate; 2) frequency 
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coverage, which is often specified by operational requirements; and 3) the number 
of antenna elements, which drives processing costs (adaptive array-processing load is 
often assumed to increase as the cube of the number of antenna elements). Circular 
arrays have little flexibility in such a trade-off. 

On the other hand, a pair of linear arrays oriented at a substantial angle to each 
other in either an X or L configuration can be implemented with unequal element 
spacing as described earlier in this chapter. The spread of elements in two dimen¬ 
sions provides the desired two-dimensional line-of-bearing estimation capability and 
also reduces the extent to which DF accuracy is direction-dependent. However, 
the beam shape, and thus DF accuracy, is still dependent on direction of arrival. 
Because the DF capability is proportional to aperture of the array projected on the 
wave front plane, it is straightforward to estimate the DF accuracy achievable with 
such an array. Figure 8.8 summarizes the results. It should be clear that an equi- 
leg X or L array produces line-of-bearing estimates that vary by VT2) with signal 
azimuth. 

This accuracy variation is apt to be acceptable for almost all operational scenarios. 



Figure 8.8 Line-of-bearing accuracy vs. azimuth for L-shaped array 
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8.7.3 Spiral Arrays 

In an attempt to satisfy all the objectives outlined in section 8.7.1, a spiral array concept 
has been considered. Figure 8.9 illustrates this approach. Such an array has a geomet¬ 
ric aperture that varies slowly with azimuth, leading to almost azimuth-independent 
DF accuracy. Further, if the spiral is populated with array elements that are spaced 
exponentially as one progresses along the spiral format, a frequency-dependent sub¬ 
set of these elements can be selected for array processing. When the frequencies of 
interest are low, elements far out on the spiral can be used; conversely, when higher 
frequencies are to be processed, more central antenna elements can be selected. This 
approach leads to a frequency-independent DF accuracy, a highly desirable sensor 
characteristic. 

Such an approach does lead to a requirement for antenna elements that cover a 
substantial frequency range. Some of the elements near the central portion of the spiral 
might be required to perform well over the full operating of the sensor. 

* 


* 


* 


* 


* 


+ ' * * 


* 


* 


* 


Figure 8.9 Spiral array geometry 
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8.7.4 Random Arrays 

I 

Arrays with randomly placed elements have been investigated for a variety of applica¬ 
tions. Randomness is desirable in that the associated irregularity reduces the likelihood 
of grating lobe formation. However, a truly random array has a nonzero probability of 
manifesting an undesirably high grating lobe over some segment of the array coverage 
sector. To avoid such a problem, it is appropriate to carefully evaluate the grating lobe 
performance of a set of randomly located antenna elements. This should not be an issue 
because the cost of this evaluation is a one-time cost that provides quality assurance 
for the array performance. 
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Direction Finding Techniques for HF Applications 


9.1 Overview 

The bounds described in the previous chapter provide measures of the direction¬ 
finding performance that can be expected from an array with a specified geometry and 
size, at the frequency associated with a particular signal. This chapter covers many of 
the important methods for direction finding; that is, estimating line of bearing (LOB). 
Recall from chapter 7 that the arrays of concern are small relative to the propagation 
distances involved. This observation motivates the view that a single plane wave reach¬ 
ing an array excites all elements with the same amplitude, and if the array elements are 
not identical, the responses to this excitation will differ. If only one signal is present, 
all array elements are identical, and element-to-element mutual coupling is modest, 
phase provides the only clue to signal direction; if the array elements differ in their 
gain response as a function of arrival angle, then relative gain may also provide clues 
to the signal direction. One should also note that most receivers perturb the element 
signals with additive noise, usually with zero mean and Gaussian statistics. This obser¬ 
vation informs the analysis presented here, providing a tractable model for estimating 
an LOB. Finally, it is worth noting that this topic has been the focus of many theo¬ 
retical and experimental research efforts, leading to the rich discourse of the current 
chapter. 

9.2 Interferometry: Phase Fitting 

The most basic approach to LOB estimation is based on the previous observation 
concerning the phase relationships among the signals received by the array elements. 
If the impinging wave is planar—and this assumption was challenged at HF for 
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Figure 9.1 Geometry of line-of-bearing estimation 


many years—the direction of travel for this wave can be specified by a direction vec¬ 
tor in space known as the Poynting vector. Then, the measured phase seen at one 
element relative to another is determined bv the distance between these elements, 
projected onto the Poynting vector associated with the wave and the wavelength. 
These phase relationships can be used to estimate the direction of the Poynting vec¬ 
tor and hence the direction of the signal source relative to the array, also known as 
the LOB. 

In the simplest two-dimensional case, consider two elements a distance D apart, 
equally spaced along the x-axis. Phase measurements will be referenced to the array 
origin, a distance D /2 from each element. Describe the LOB vector by the angle 6 
it makes with the y-axis. This angle corresponds to the angle between the LOB and a 
reference direction that is by convention normal to the linear array. This geometry is 
shown in figure 9.1. 

It should be clear that the projected distance determining the phase between 
these elements is DsinO, and the corresponding phase difference, in radians, is 
(2tcD/X) sin 0, where X is the wavelength of the impinging wave front. The phase 
of the signal seen at the element at distance D/2 from the origin is early relative to the 
phase seen at the origin; by convention, this is a negative value. 

The projection can also be determined by considering the Poynting vector as an 
algebraic column unit vector u= [— sin 6 — cosd} 7 and the element locations in a 
matrix V e with the 77 -th row containing the x-y coordinates of the 77 -th element 

-D/2 0 

+D/2 0 



(9.1) 
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With this notation, the product of the element position matrix and the Poynting vector 
becomes a column vector of element phases 

(2tc/ k)P e u = [(jvD/X) sin (9 — ( tcD/X ) sin@] 7 . (9.2) 


Thus, as above, the phase difference seen between the two elements is ( 2n/k)Dsind . 

Suppose now that a wave impinges on a linear array of N elements along the x-axis; 
the first of these is designated the reference element. Each element will provide its 
estimate of phase relative to the phase at the reference element, the 72-th such relative 
phase giving rise to the 72 -th element of an observed phase (row) vector £ = {£ n }. 

If the element positions along the x-axis-oriented line array are placed in an ele¬ 
ment position matrix P e , as in equation (9.1), with the 72 -th row containing the x-y 
coordinates of the 72 -th element, 


x\ 0 



xjsi 0 


(9.3) 


the result is an expected phase vector % e = (2jt /X)V e u. An ad hoc approach to LOB 
estimation exploits a simplistic relationship between the expected phases, and the 
observed phases, £: these are expressible as an over-determined set of linear equations 
for the signal direction vector u with solution 


u = -(A/(2^))(P,Pf)- 1 Pf|. (9.4) 

Because the phase noise is not Gaussian, this approach is not rigorously optimal, but 
it can often provide a useful LOB estimate. Note, however, that the assumed array 
geometry places the elements along the x-axis, and as a result the second column of 
the matrix V e is identically zero. This implies an indeterminate solution for the second 
component of the signal direction vector, and thus the first component, — sin 0, is the 
only component available for estimating 6. 

/S. 

This approach provides an LOB 6 relative to a linear array normal, but this angle 
only identifies a cone on which the impinging signal direction may lie. A planar array 
will be necessary to identify an LOB completely. 

Thus, if the problem is viewed in three dimensions and a spherical coordinate sys¬ 
tem is used, the unit vector associated with the impinging signal is modified to be 
u — [— sin 6 cos 0 — sin 6 sin 0 — cos 6] T , and the element location matrix P e is an 
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N x 3 matrix such that the 72-th row contains the x-y-z coordinates of the 72 -th ele¬ 
ment. In this case, the projection becomes P f u. The expected phase vector is again 

— — (2 7r/A.)P*u, and the normal solution for the arrival angle can be determined in 
a similar fashion to the solution in equation (9.4). 

With a two-dimensional array, one can define the array on the x-y plane so that 
as in the linear array case, the z coordinate is zero for all elements. The equivalent 
effect here is that the normal equation solution provides a 2-vector corresponding to 
the estimate for the x and y components of the signal direction vector u (since u has 
unit norm, the z component is then calculable). These two components can be used 
to compute the zenith and azimuth angles, 6 and 0, by noting that n x = sin 0 cos 0, 
and Uy = sin 6 sin 0. Thus, the ratio u.y -r n x = tan 0. This leads to an estimate for 0 
that can be used with the estimate for u x to estimate 0. 

As a result, the azimuth and elevation angles of the impinging wave can be com¬ 
puted using the two components of the vector u. Such interferometric techniques have 
been used for many years. 

An illustrative example is given next for a line array. Consider a thinned, five- 
element array similar to the example introduced in section 8.3.2. Figure 9.2 indicates 
the element locations on the abscissa and the expected phase for a wave arriving at an 
LOB of 20° from array broadside. The estimated phase for each element is plotted as 
the ordinate, assuming the element signal-to-noise ratio is 10, corresponding to a stan¬ 
dard deviation of the phase estimates of 0.15 rad, or about 9°. Using the phase-fitting 
technique described in this section, the estimated LOB for one Monte-Carlo simula¬ 
tion was 19.75°, 0.25° from the true LOB. A100 trial simulation for these conditions 
gave this same result (.25° = 0.0044 rad) as the standard deviation in the error. This 
is only somewhat greater than the .0032 rad given as the Cramer-Rao bound for this 
array computed and plotted in figure 8.3 of chapter 8. 

9.3 Maximum Likelihood Approach to LOB Estimation 

The phase-fitting approach to direction finding is heuristically satisfying, but it has 
limitations in environments where there is multimode propagation and/or cochannel 
interference. A maximum likelihood approach, exploiting the Gaussian statistics of 
typical receiver noise, can be adapted to handle multiple signal environments and in 
the single signal case, has a firm mathematical justification. A frequently used approach 
assumes the signal amplitude has a noise-like character, with a Gaussian distribution. 
Such a signal model is appropriate for communication signals for which the format 
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Figure 9.2 Example of phase fitting, thinned array 

is known, but the unknown data modulation produces uncertainty about the precise 
signal waveform. 

As described in chapter 6, the data model for L observations with an array of TV 
sensors involves an TV x L data matrix Z with columns containing the TV-component 
sample vectors collected at a specific time. Rows represent the time domain data from 
specific elements of the array. If the receiver noise consists of independent, zero-mean, 
Gaussian samples with covariance matrix X, the probability density function for the 
data is given by 

p{ Z|X) = 17T det X |~ L exp{-[(Z -X) // E~ ; (Z - X)]} (9.5) 

= \tt detE| -Z, exp{ —7r[X -i (Z —X)(Z —X)^], (9.6) 

where X = VS is an TV x L matrix representing the model of the signal or signals 
imbedded in the data, leading to the log likelihood function 

lnp(Z|X) = -LNInn - LN\n |X| - Tr[T~ J (Z-X)(Z-X) H ]. (9.7) 

Since the channels are assumed to have independent noise sources, it is usually assumed 
that the covariance matrix is diagonal, with its elements representing the noise level 
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associated with each of the antenna channels. Often the noise is at the same level in 
all channels, in which case the noise covariance matrix becomes an identity matrix 
multiplied by a constant a 2 . 

Now consider the case of a single signal, simplistically modeled as having a ran¬ 
dom amplitude that manifests a circular Gaussian distribution. Recalling that the 
array signal model assumes that the same amplitude variations of the impinging sig¬ 
nal affect all elements of the array, the received vector of element signals z is clearly 
a{t)v( u), where v(u) is defined as the vector of array element responses to a sig¬ 
nal arriving with signal direction vector u. If it is assumed, as discussed earlier, 
that the noise is equal in all channels and independent from channel to channel 
and if the signal to noise power ratio is specified as a parameter P, the covariance 
of the received signal becomes dependent on the arrival direction of the signal, as 
indicated by 

E r (u) = I + Pv(u)v // (u). (9.8) 


Then, the log likelihood ratio is given by 


L 


In p{ Z|u) = —LNlnir — Tindet £,-(u) — z^X r ^(u)z/. (9.9) 

/=1 

The inverse of E r (u) can be shown to be 


E- ; (u)=I- 


P 

NP + 1 


▼( 11 ) 1 ^( 11 ), 


and its determinant to be 


det X r ( u) = NP + 1. 


( 9 . 10 ) 

(9.11) 


This leads to the log likelihood ratio 


L 


\np{Z\u) = -LN In n — L \n(NP +1) — zf (I 


P 


1=1 


NP + 1 


v(u)i^(u))z/. (9.12) 


This log likelihood ratio is maximized when the term 

L 

D ( u ) = Y2 (u)z/ 


(9.13) 
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(9.15) 

1=1 
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(u) ( y z/zf ] v(u) 

(9.16) 

\L=1 / 


y H (ujR^vCu) 

(9.17) 


is maximized. Thus, the maximum likelihood procedure sought in this section can be 
found by evaluating this expression for all possible us and finding the maximum. 

The expression in equation 9.17 has a simple interpretation. Since v(u) is a vector 
corresponding to the complex signal that would be observed at the array of antenna 
elements if it were coming from a direction u, the sum expressed by this equation 
is simply a measure of the power seen by the set of antenna elements, measured by 
the covariance matrix of the observations, compared to the power that would be seen 
if the signal was coming from this specific direction. When a postulated direction 
u matches the observations to the greatest degree, that direction is chosen for the 
direction-finding estimate. 

In many HF applications, the antenna array is based on a linear geometry and the 
search over u demanded by this maximization is one-dimensional, with the azimuth 
associated with u specifying each point in the search. When an antenna array is dis¬ 
tributed over a planar area, there is potential for both azimuth and elevation changes 
in the observations and therefore potential for two-dimensional direction finding. 
The balance of this section addresses such situations and points out the algorithm 
extensions that should be included if such two-dimensional DF is to be accomplished. 

The basic concept is straightforward. One should extend the search for an appro¬ 
priate signal direction from a single dimensional search over the azimuth angle 6 to a 
multidimensional search over a two-dimensional angle vector, usually consisting of two 
components, azimuth and elevation. With this interpretation, the angle 6 becomes a 

2-vector 0, and the search is over a two-dimensional manifold indexed by this vector 
direction. 

Before moving on, it is appropriate to specify the steering vector v(u) relevant to 
the azimuth-elevation search. Just as in the one-dimensional case, the steering vector 
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contains an exponent that represents the phase delay observed at a specific antenna 
element relative to a reference. This phase delay is again proportional to the carrier 
frequency^ and the time delay which is determined by the speed of light c and the 
propagation distance 8. The latter is calculable as a projection of a unit vector in the 
direction of the incoming wave onto a geometric vector from the reference location to 
the antenna element in question. 

If the reference location is chosen as the origin of a spherical coordinate system 
coinciding with the array geometric origin, with element coordinates x,y, and £ for 
the N antenna elements specified as the rows of an N x 3 matrix P e , the set of element 
delays r for a signal arriving from zenith angle 6 (elevation angle c = tc j2 — 6) and 
azimuth (j) becomes 

1 , . 

r = —P,u, (9.18) 

c 

where u is the unit (Poynting) vector in the direction of the incoming wave, 

u— [sin0 cos0 sin 6 sin <fi co sd]. T (9.19) 

The negative sign in equation (9.18) specifies that the wave is incoming, and therefore 
the propagation delay to an element in the wave direction is negative. 

As an example, consider the linear array problem illustrated in the previous section. 
With the search space quantized into 900 equal steps, figure 9.3 shows the value of the 
metric for each step. The maximum value obtained can be interpolated from the data 
points to obtain an maximum likelihood (ML) LOB estimate of 20°, identical to the 
value for the modeled LOB. Additional precision could be obtained by interpolating 
the metric, but this embellishment is not necessary to illustrate the effectiveness of the 
ML method. 

In this example, the number of search space points was seemingly chosen rather 
arbitrarily. In general, this number should be chosen to be commensurate with the 
array beamwidth at the frequency of interest and with the element signal-to-noise 
ratio. The bounds described in the previous chapter can be a guide to choosing it. 
Most conservatively, if the Cramer-Rao bound suggests that errors for a linear array 
could be as low as a beamwidths, one might test the manifold at this spacing. For 
this example, with a 3A array, the beamwidth is 1/3 rad (20°); then, the Cramer-Rao 
bound of 0.0032 rad (0.18°) suggests that 982 points be used. Here, somewhat fewer 
were used to reflect the beam broadening that occurs near the extremes of the search 
region. 
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9.4 The Vector Space Approach 
9.4,1 The MUSIC Algorithm 

In the previous section, each set of simultaneous samples, or snapshots z, was modeled 
as a vector in N-dimensiorial space. This vector consists of two components, a signal 
component specified by the product of a complex amplitude and a steering vector 
for each arriving signal as well as a noise component contributed by receiver noise. 1 
Throughout this volume, the array will be assumed small enough that the complex 
amplitude of each signal arrives identically at each antenna element. Superimposed on 
this complex amplitude is a phase variation that depends on the signals direction and 
the relative element locations as dictated by equation (6.7). These phase variations are 
captured in the steering vector, which remains fixed for each arriving signal for the 
duration .of the processing interval. It should be emphasized that these characteristics 
of a wave areTrue separately for each such wave impinging on the array. If there are 
several such waves, the antenna elements are excited by the sum of the individual wave 


1 Inaccuracies in the steering vector can contribute to the effective noise as well. 
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contributions. Assuming a linear, coherent receiver, the output will reflect a linear sum 
of the contributions. 

This wave description focuses on the signals exciting the array elements. However, 
the element outputs can, and often do, differ from this ideal situation, especially if 
there are significant electromagnetic differences among the elements. As will be dis¬ 
cussed later, such differences can be accommodated by a calibration process; for the 
present, it will be assumed that all array elements are identical, with identical output 
characteristics as a function of arriving wave direction. 

If the value of a complex sample observed at a particular antenna element is repre¬ 
sented as one coordinate of an N-dimensional complex space, the received vector z is 
represented by a point in this space; successive received vectors z (t) draw out a curve 
therein. For the circumstance where only one signal impinges on the array, the suc¬ 
cessive received vectors change as a result of signal modulation, but the steering vector 
contributions to successive x{t) s do not change. That is, in the signal model for the 
f-th signal, 

z* W = v0k) a k( t) + n(f), (9.20) 

where t) is a sequence of complex scalars representing data modulation samples on 

the £-th signal, and \(0^) is a column N-vector specifying the response of the array 

—> 

elements to a signal arriving from a direction specified by the 2-vector with azimuth 
and elevation as its components. 

An equivalent expression using matrices can be written as 

Z*=V(0*)aJ + N, (9.21) 

where Z/, is an N x L matrix incorporating the vectors z,',(t) as its columns, a 7 is the 
amplitude row vector of length A, and N is an N by L matrix of noise samples. When 
all K signals are included, with the matrix A having rows aj representing amplitudes 
of the impinging signals, the resulting data matrix becomes 

Z-V(0)A + N. (9.22) 

Here the matrix V(0) has rows corresponding to array elements and columns cor¬ 
responding to the arrival angles of the impinging waves; A is an amplitude matrix with 
the f-th row containing the L modulation samples for the £-th signal. The arrival angle 
matrix 0 has K columns of 2-vectors corresponding to the LOBs associated with the 
K signals; the arrival angle is assumed to remain fixed for each signal so that succes¬ 
sive received snapshots share the same steering vector, forcing them to lie along the 
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same line, or in a common one-dimensional complex subspace, in the N-dimensional 
complex subspace established by the sensor-based coordinate system defined earlier. 

When two signals arrive from different directions, each will have a different 
steering vector, and each will contribute to the received signal inside a different 
one-dimensional complex subspace. But the two signals combine additively at each 
element, leading to a total received vector that is the linear combination of two steering 
vectors, with the two signal amplitudes being the coefficients in this linear sum. Such a 
linear combination implies that the sum lies in a two-dimensional complex subspace. 

It should be clear by induction that when K signals impinge on the array, as long 
as K < N, the received signal snapshots will lie in a K-dimensional complex subspace. 

With the tools of an N-dimensional vector subspace in hand, attention can 
shift to a specific direction-finding algorithm that exploits them, the Multiple 
Signal Classification (MUSIC) algorithm developed by Ralph Schmidt in his doctoral 
dissertation [1]. 

The MUSIC technique exploits the subspace concept, using the data matrix Z to 
determine the specific subspace corresponding to the steering vectors of the K arriving 
signals. This data matrix Z includes two time-varying components, the signal modu¬ 
lation embedded in A and the receiver noise in N. Although the signal is not totally 
random (it cannot be so if it is to convey meaning to the intended receiver), the data 
modulation introduces enough uncertainty in the signal to motivate invoking the law 
of large numbers and thus to model the signal as a second-order stochastic process 
characterized by a correlation matrix. Then one can characterize Z using a second- 
order statistical measure, in particular its covariance matrix, to help identify the signal 
subspace introduced earlier. 

This covariance matrix R is computed in accordance with 

Vi = ZZ H =Y(®)AV H (®) + o 2 l, (9.23) 

where A = E{AA^}, the expectation of the outer product of the amplitude matrix 
with itself. 

The focus here is on finding V(0), and the covariance matrix provides a tool to 
do so. Note that the statistical independence of the noise in the receiver channels, 
both temporally and channel to channel, makes the outer product of the noise sample 
matrix N diagonal and representable by a constant multiplied by the identity matrix. 

The format of this expression for the data covariance matrix R immediately suggests 
that eigenanalysis will provide useful tools. Recalling that V(0) is an N x K matrix, 
it is apparent that the product V(0)*4V^ (©) is of rank K <N. Thus, even though 
the noise component of R makes it of rank N, stronger signal contributions relative 
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to the noise contributions make most of the power in the covariance matrix lie in a 
rank K subspace. 

Thus, an eigenanalysis of R will produce a set of N ordered eigenvalues, but K 
of these should be larger than the others. These larger eigenvalues—and their corre¬ 
sponding eigenvectors—can be used to estimate the desired signal subspace of rank 
K\ the smaller eigenvalues, described as “noise” eigenvalues, together with their cor¬ 
responding eigenvectors define the “noise subspace.” As with any two subspaces, each 
defined by all linear combinations of some disjoint set of eigenvectors, these subspaces 
are orthogonal. This observation provides the key to the MUSIC algorithm: 

Any signal vector that lives in the signal subspace determined from the eigenanalysis ofR is orthogonal to 
the noise subspace defined by the noise eigenvectors associated with the noise eigenvalues ofR. 

Based on this principle, the MUSIC algorithm can be detailed [2]: 

1. Compute an estimated covariance matrix R from the N-channel data. 

/\ 

2. Perform an eigenanalysis ofR. 

3. Order the eigenvalues and determine how many of these are small enough to 
be considered “noise” eigenvalues. 

4. Define a noise subspace matrix E c using the associated noise eigenvectors as its 
columns. 

5. Consider all possible steering vectors, much as was done in the previous section. 

6. For each possible steering vector, compute its distance to the noise subspace 
using d = V(®) H E c E?V(®). 

7. Form the MUSIC spectrum, defined as the inverse of this distance. (Using the 
inverse makes the signal vectors, which are orthonormal to the noise subspace, 
stand out as large values in the MUSIC spectrum.) 

8. The desired directions-of-arrival estimates are based on the K largest local 
maxima in this spectrum. 

This algorithm is illustrated with a simple example for the thinned linear array in 
figure 9.4. 

9.4.2 Root-MUSIC 

The MUSIC algorithm (section 9.4.1) has proven to be an effective approach to line- 
of-bearing estimation in the HF sensor environment. Implementation of MUSIC 
presents two computational challenges: eigenanalysis of the sample covariance matrix 
and search over the signal manifold for peaks in the MUSIC spectrum. Root-MUSIC 
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Figure 9.4 Example of MUSIC algorithm, thinned array 

is a variation in the MUSIC algorithm that can significantly reduce the computational 
load associated with the manifold search component. As will be discussed later in this 
section, Root-MUSIC can also be more robust in the face of calibration or data collec¬ 
tion errors. But its use is limited to arrays with a particular element-spacing structure, 
and as a result, its use is not nearly as extensive as is Spectral MUSIC. 

To understand the Root-MUSIC concept, consider an array with uniformly spaced 
elements having spacing d. If the location of the first array element is used as the coor¬ 
dinate frame origin and all elements are deployed along the y-axis, the n -th component 
of the steering vector is, from equation (6.7), exp[i27Tp zz • u/A], where p^ z is the row- 
vector containing the coordinates of the 72 -th element. Thus, the steering vector has a 
particularly simple structure, 

v= [^pP u ^pf } u ••• ^p^u]^. (9.24) 

But the element vectors are all of the form p^ = [0 d{n — 2)] so that the steering 
vector becomes 

y _ j*^— (jksin (jksin 0<2)-l ^—(jk sin ^ ^(— jk sin • (TV—1)"j ^ 

= [z* z 'z 2 
wher e ^r> sinW . 


(9.25) 
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Here, as in Spectral MUSIC, the eigenvalues are used to separate a signal subspace 
specified by the larger K eigenvalues (A s ) and their associated eigenvectors (Ey) from a 
noise subspace based on the smaller (N — K) eigenvalues. Also as in Spectral MUSIC, 
the fact that the steering vectors are orthogonal to the noise eigenvectors becomes 
crucial. As a result of this observation, stated mathematically as 

V H (z) E n = 0, (9.26) 

there are (N — K) constraint equations that lead to the Root-MUSIC algorithm. 

The key to Root-MUSIC is the special form of the steering vector, equation (9.25). 
This special form allows the orthogonality conditions in equation (9.26) to be written 
as polynomials: 


Sk(z) = 





n =1 


k — s T1 ,..., N. 


(9.27) 


The orthogonality conditions in equation (9.26) suggest that the desired signal direc¬ 
tions are the roots of these polynomials. Thus, with these polynomials in hand, a 
Root-MUSIC metric D(z ) can be formed: 

N 

D(z) — J2 S k (z)S* k (l/z*), (9.28) 

k—s-\r\ 


and the Root-MUSIC algorithm becomes a search for the set of z values such that 
D(z) = 0. 


From the complete set of roots, the signal directions can be determined by 
evaluating which of the roots have approximate unit magnitude; that is, \z\ ^ 1. 

This discussion of Root-MUSIC closes with a brief introduction to variations in 
the basic algorithm that allows it to be used with unequally spaced elements, or with 
two-dimensional arrays. In the most general case, if the / 2 -th array element is located at 
coordinates p” = [p nx p ny 0], recall that the steering vector for a signal in the direction 
specified by the spherical coordinate angles (0, <fi) is given by a vector v with the / 2 -th 
component 


7 . _ 0 —jKpnx sin 0 cos p+pny sin 6 sin p) 

u n — tT 


(9.29) 


But it has been shown several times in this chapter that after eigenanalysis of the covari¬ 
ance matrix such a steering vector is orthogonal to the noise eigenvectors. With this 
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requirement, it can be observed that 


N 

^ ' h~>nkVn ~ Q? (9.30) 

n=l 

where ^ is the £-th eigenvector, and k is chosen to correspond with a noise eigenvalue. 
With two such noise eigenvectors, two such equations can be obtained. These two 
equations can be solved numerically for the two unknown angles 0 and qb. The search 
for a solution is constrained to the unit sphere specified by the domain of the two 
unknown angles. When there is more than a single signal, more noise vectors are 
needed, two for each unknown signal, increasing the dimensionality of the search, but 
the approach is the same. No detailed results will be given, but the algorithmic concept 
should be evident. The interested reader might consult the reference [3]. 

It should be evident that this approach exchanges a manifold search involving the 
signal directions for a multidimensional search for the solution to the multiple variable 
equations in equation (9.30). Specifics of a particular application would be needed 
to compare the computational costs, a topic that is beyond the scope of this work. 
However, the author is unaware of any application of these methods in the HF receiver- 
processing world. 

9.5 Direction Finding with Polarization-Sensitive Arrays 
9.5.1 Motivation 

Earlier in this chapter, the MUSIC algorithm was described under the assumption that 
all array elements share a common polarization characteristic. If this is not the case, 
appropriate processing can exploit this array feature and potentially improve array 
performance. When faced with signals having different polarization signatures, per¬ 
formance enhancements include increased signal discrimination capability and more 
precise direction-finding performance. 

Polarization-based processing [12] extends the definition of an array response vector 
to include characterization of an array s response to signals as a function of their polar¬ 
ization state. By convention, the polarization state of a plane wave signal is described 
in terms of its horizontal and vertical electric field vector components. The horizontal 
electric field component is perpendicular to the plane of incidence defined by a verti¬ 
cal plane at a given azimuth angle <p; thus, this horizontal electric field component is 
often identified as the ^-component. Recalling that the Poynting vector u lies in this 
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incidence plane and that the electric field rotates in a plane orthogonal to it, the verti¬ 
cal electric field component lies in the plane of incidence orthogonal to the Poynting 
vector. Because the zenith angle 6 is measured in this vertical incidence plane, this elec¬ 
tric field component is identified as the 0-component. Note that although this electric 
field component lies in the vertical plane, it is only vertical (parallel to the £-axis) 
when 6 = 7t/2. 

A vertically polarized antenna will only sense the vertical component of the wave, 
and a horizontally polarized antenna will only sense the horizontal component. 
Because most HF signals are carried by waves with elliptical polarization; that is, 
with phase-related vertical and horizontal components, an array of vertically polar¬ 
ized antennas, such as vertical monopoles, will only observe the vertical component 
of the arriving wave. Although the elements will be able to capture at least some por¬ 
tion of the wave energy, the energy carried by the polarization component of the wave 
orthogonal to the antenna polarization will be missed. If the array antennas are iden¬ 
tical, they will all capture the same energy fraction (to the extent that the wave front 
does not vary across the array). 

When characterizing the array response vector for polarized waves, the vector must 
include a factor representing its response as a function of the incoming wave polar¬ 
ization. Should this factor be common to all antenna elements, it will be common to 
all components of the array response vector. It can therefore be factored out, and even 
though this factor will affect the fraction of the incoming wave energy captured by 
the antenna elements, this factor can essentially be ignored in any direction-finding 
array-processing algorithm. 

If all array elements are identical and dual polarized, with separate output ports for 
each of the two orthogonal polarizations, each element will capture both polarization 
components of the incoming wave, making the elements more efficient. If these ele¬ 
ments are all physically oriented in the same direction, their element patterns (response 
as a function of impinging signal direction u) will be identical and can be factored out 
of the total array response, leaving the element phase center displacements as the only 
contributor to direction-of-arrival determination. 

Now if the array elements are not identical but instead manifest diverse polariza¬ 
tion characteristics, each element will respond differently to an incoming wave. Thus, 
the array response vector components will not only include the element-to-element 
time delay component characterizing identical element arrays but will also include 
a polarization-sensitive factor that is different from one element to the next. This 
factor will not factor out of the array response vector as it would with identical ele¬ 
ments. As a result, the polarization sensitivity should be included in the design of an 
array-processing technique. 
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Most land-based HF antenna arrays fielded to date have included only identical 
vertical antenna elements. Shipboard arrays may be outfitted with identical elements, 
but their proximity to structural components of the host ship give them a polarization 
characteristic that is not only sensitive to both polarizations of an arriving wave but also 
different from one element to the next, perhaps in unpredictable ways. Once such an 
array is installed, it is possible to develop a polarization-sensitive calibration through 
measurement or electromagnetic modeling and to incorporate the element-specific 
polarization characteristic in the array steering vector. Because, as suggested earlier, 
polarization-sensitive elements have the potential for performance improvements, it is 
worthwhile to understand those opportunities available if a polarization observation 
capability is implemented in the processing. 

9.5.2 Polarization-Sensitive DF Algorithm 

Polarization exploitation can best be described as an extension of the steering vec¬ 
tor concept, making a steering vector sensitive to the polarization, as well as the 
direction, of the arriving wave. Because the polarization of a wave can be charac¬ 
terized by its vertical and horizontal components, it is appropriate to define two 
steering vectors to describe an array response, characterizing the response to a hor¬ 
izontally polarized wave and v v characterizing the response to a vertically polarized 

—> 

wave. This wave is characterized by its angle of arrival 6 as well as by a 2-vector 

that specifies the relative strengths of its two polarization components, k p . Since 

an arbitrarily polarized arriving wave can be described as a linear combination of 

horizontal and vertical waves, the response of the array to a wave arriving from a 
—> —* 

direction 9 with polarization state k^, can be specified by the N x 2 matrix Vp(9) 
and can be written as a linear combination of v/, and v v , weighted by the polariza¬ 
tion state components. Using matrix notation, this array response can be written as 

V^k^, 

b(6,k p )=Vp(6)kp, (9.31) 

where k^ is the two-component vector 2 describing the polarization state of the 

incoming wave, and Yp(9) is the N x 2 matrix [v/, v v ]. 

The basic approach to incorporating polarization in a direction-finding algorithm 
extends the response of the array to include polarization sensitivities and to incorpo¬ 
rate these sensitivities in the measure of fit associated with the particular algorithm. 

2 k is a two-component column vector, with the first component proportional to the E# (horizontal) component of 
the arriving wave and the second proportional to the (vertical) component. 
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Then, because the polarization state of the impinging wave is generally unknown, its 
polarization becomes a nuisance parameter that must be optimized over as a step in the 
appropriate spatial spectrum computation. Because MUSIC has proven to be a very 
effective direction-finding approach at HF, its extensions for polarization-dependent 
arrays will be illustrated here. 

Recall that the MUSIC spectrum was computed using the metric 

d(6) = V(6) H E c EfV(6), . (9.32) 


where E c is a matrix comprised of the noise (least significant) eigenvectors of the covari¬ 
ance matrix, and d(9 ) represents a distance from the signal subspace, resulting in the 
MUSIC spectrum 

Smu(9) = (9.33) 

If polarization characteristics are included in the steering vector and the aforemen¬ 
tioned minimization process is included, the resulting distance is 


dMUP(d ) — min 

k P 


i$V p (O) H E c E?V p 0)k p 


(9.34) 


Here, the specification for the steering vector has been modified to become the N x 2 
matrix Vp(0) = [v^ v*,], which includes the dependence on signal polarization, and S 0 
is the noise covariance matrix associated with the array, normally assumed to be <r 2 I. 

With this revised computation for the distance from the noise subspace, equa¬ 
tion (9.33) can be evaluated as before, leading to a polarization-sensitive MUSIC 
spectrum as detailed shortly. 

Note that the expression for cImup is a ratio of quadratic forms, a so-called Rayleigh 
quotient generally defined by a relation of the form 


x^Ax 


(9.35) 


for which the minimum is well known to be the smallest eigenvalue of A. This min¬ 
imum occurs when x is the eigenvector corresponding to this minimum eigenvalue. 
This can be stated as the relationship 


Ax 


min 


A 




(9.36) 


so that 'Kmin must be the smallest root of the equation. 


det(A — AI) = 0. 


(9.37) 
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Putting equation (9.34) into this form results in a search for the minimum root of 

detfV/^ErEfV/d) - XV p (6) H S; l Y p 0) } = 0 . ' (9.38) 

This minimum root becomes the distance to the diverse polarization noise subspace, 
^MUP 3 so that 

Smup 0) — \dMUp(&)\ 1 - (9.39) 

Thus, the methodology for direction finding with a polarization-sensitive array can be 
summarized as follows: 

• Assume that there are array manifold specifications for the antenna array 
corresponding to two orthogonal polarizations. 

• For each arrival angle in these array manifold specifications, set up the MUSIC 
spectrum specified in equation (9.34). 

• Introduce a change of coordinates, k r = (V^Vp)^ 2 kp, to put the argument of 
the equation (9.34) minimization in the form of equation (9.35) with the matrix 
A r = VfE c EfV r , where V r = Y p (Y^Y p r l/2 . 

• Eigen analyze A to find the smallest eigenvalue and the corresponding eigenvec¬ 
tor. This eigenvector k r provides a route to the desired best-fit polarization for 
an arriving signal at the specified arrival angle. 

• Undertake a one-dimensional manifold search to find the arrival angle corre¬ 
sponding to the peak of the MUSIC spectrum, equation (9.39), associated with 
the minimizing polarization state. Note that this procedure avoids a multidi¬ 
mensional search that would be needed if search over both polarization state (two 
degrees of freedom) and arrival angle (one or two degrees of freedom) were used. 

• The angle of arrival associated with the two-dimensional steering vector (array 
manifold entries for each of the two orthogonal polarizations) and the polar¬ 
ization state (i.e., the eigenvector) corresponding to the minimization for this 
arrival angle specifies the sought-for LOB and polarization; the second of these 
may not be needed by the user, except as a means to obtain the angle of arrival. 

9.5.3 Vector Sensor Example 

To illustrate the direction-finding approach for a polarization-sensitive array, consider 
a three-element structure comprised of three colocated antennas; that is, all three ele¬ 
ments have a common phase center. Such a structure is often classified as a vector 
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sensor, although that term most frequently refers to a six-element structure with three 
of the elements consisting of orthogonal dipoles and three more elements based on 
orthogonal loops. Here, to simplify the description while conveying the vector sensor 
concept, only three elements are used. 

The first element is a vertical dipole, with feed point at the origin of a conventional 
rectangular x—y — z coordinate frame, responding only to the vertical component of 
the impinging electric field. An arriving wave with vertical polarization has, by defini¬ 
tion, its electric field vector in the vertical plane, the same plane in which the vertical 
dipole lies. The element response can be described as the projection of the electric field 
onto the dipole axis. Thus, the response is E 0 sin 9, where E 0 is the magnitude of the 
electric field, and 9 is the zenith angle describing the arriving signal direction. Note, for 
example, that the dipole response is zero if the wave is arriving from overhead (9 = 0) 
and E 0 if the wave is arriving in the horizontal plane (9 =jt/2). If the impinging wave 
is horizontally polarized, the electric field is perpendicular to the vertical plane passing 
through the z-axis and making an angle 0 with the x-axis. This element lies in this 
plane and thus does not respond to this wave. 

The second element is a vertical loop in they — £ plane with its center at the coordi¬ 
nate frame origin; such an element responds only to the x-component of the magnetic 
field. If the arriving wave is vertically polarized, the magnetic field is perpendicular 
to the vertical plane passing through the z-axis at an angle 0 with the x-axis. Its x- 
component is —H 0 sin 0, and its y-component is +H 0 cos 0. If the impinging wave is 
horizontally polarized, its magnetic field is in the vertical plane perpendicular to the 
wave direction, which is at an angle 9 to the z-axis; thus, the component perpendicular 
to the y-z plane, in the x-direction, is H 0 cos 9 cos 0. 

Finally, the third element is also a vertical loop but is oriented in the x — z plane 
with its center at the origin; this element only responds to the y-component of the 
magnetic field. If the arriving wave is vertically polarized, the magnetic field is hor¬ 
izontal, perpendicular to the vertical plane passing through the £-axis at an angle 0 
with the x-axis. Its y-component is +H 0 cos0. If this wave is horizontally polarized, 
the magnetic field is in this vertical plane, and its y-component is +H 0 cos 6 sin 0. 

These relationships can be summarized with the 3x2 response matrix 


V 


P 


sin 9 

l 


0 


— ~ sin 0 -J- cos 9 cos 0 

Vo wo 


, — COS 0 — 

L Vo Vo 


cos 9 sin 0 


(9.40) 


where rj 0 = is the characteristic impedance of free space. 
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Sample number 


Figure 9.5 Vector sensor example data—simulated signals are real 

Now consider an 11 MHz signal arriving from an azimuth of 30° and an elevation of 
20°. Assume it is sampled at 50 Msps for 100 samples, corresponding to a 0.5 fis data 
collection interval, at each element of this simple vector sensor. Note that the vector 
sensor elements are all phase aligned, and the simulated signals are real; as a result, the 
data is real. Figure 9.5 shows the signals seen at the three elements. 

The 100 samples of data are collected and used to form a covariance matrix in the 
usual way. Because there are only three elements in this vector sensor, the data space 
has dimension three, and therefore there are three eigenvectors to span the data space. 
As discussed earlier in this section, the polarization diverse direction-finding process 
requires two eigenvectors to characterize the signal subspace. Since there are only 
three eigenvectors to span the entire data space, a single eigenvector defines the noise 
subspace. This is the vector corresponding to the smallest eigenvalue. This eigenvec¬ 
tor is used in the polarization diverse MUSIC algorithm, and the resulting MUSIC 
spectrum is shown in figure 9.6. It is clear that such processing of the vector sensor 
data produces a LOB aligned with the direction of the impinging signal. As can be 
expected with a minimal aperture, the accuracy obtained from the vector sensor data 
is limited. 
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Figure 9.6 Polarization diverse MUSIC processing results for example vector sensor- 
decibel color scale 

9.6 Feature-Based Methods 

All previous sections of this chapter covering direction-finding methods made no 
assumptions about the signal modulation. This section deviates from that assumption 
and focuses on techniques that do consider how the signal is modulated and through 
this modulation invest the signal with identifiable features that can be exploited to 
enhance direction-finding performance. Because the signal of interest possesses a well- 
defined “feature,” appropriate signal processing can de-emphasize signal energy that 
does not manifest this feature and thereby improve performance on the desired sig¬ 
nal. This section will describe two approaches to feature-based direction finding. One 
approach is based on observed changes in the signal structure spectral or temporal 
characteristics; the other is based on temporal periodicities in the signal structure. 

9.6.1 Change Detection Approaches 

As discussed at length in chapter 7, which is dedicated to a variety of detection meth¬ 
ods, it is appropriate to review that material with a description of direction-finding 
methods based on sudden changes in the signal structure. Such changes can occur 
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in either the spectral or temporal domain. The initial discussion here will focus on 
the spectral domain, but later its adaptation to the temporal domain will be briefly 
outlined. . 

As discussed in detail in chapter 4, most HF wideband array-based receiving systems 
collect a continuous stream of wideband data from each antenna element, digitizing 
all such streams at a high sample rate. Because the HF environment is populated with 
many narrowband signals—and some wideband signals as well—such receivers utilize 
spectral analysis methods to frequency-channelize these streams into many narrow- 
band cells. These cells are normally contiguous and jointly cover the spectral region of 
interest to the receiving systems users. The frequency channelization process is usually 
implemented using fast Fourier transform (FFT) techniques and reduces the sample 
rate in each resulting narrowband cell in accordance with the cell bandwidth. 

This process can be crudely 3 described by considering a block of L samples for each 
of the N antenna elements, organized as a data matrix Z with N rows and L columns. 
The spectral analysis process involves subdividing the rows of this matrix into blocks 
of length K; then, a length-TT, windowed FFT is performed on each block, producing 
a sample in each of the K FFT cells. The original data block of length L produces 
M — L/K samples in each FFT cell at a sample rate that is l/K of the original sample 
rate. Details of this spectral analysis process were covered in chapter 4. Suffice it to 
say that the result of this spectral analysis process is a data cube with dimensions N 
(antenna) by K (frequency cell) by M (time). Consider this cube to be comprised of 
K,N x M data planes (matrices). The k-th plane in this cube is an N x M data matrix 
Z^, with rows corresponding to narrowband samples from a specific antenna element. 

The present goals are to examine the set of narrowband frequency cells generated 
by this FFT process and to determine which cells contain distinct signals, then to 
estimate their steering vectors and from these, their directions of arrival. Consider 
scanning the frequency cells in order, from low to high. Assuming that cell activity is 
characterized by a Gaussian process, one can characterize the signal activity in the £-th 
cell by a covariance matrix computed from a series of M FFT output samples from 
that cell using 

1 M 

(9 - 41) 

i—1 


3 A more detailed description would describe overlap and windowing features designed to spectrally isolate 
adjacent cells from each other. 
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The scanning process proceeds by characterizing the first cell in the band of interest 
using equation (9.41) and then asking whether the second cell appears to have the 
same character. Such a question can be addressed quantitatively [13] by computing the 
probability that samples from this test (second) cell, Z 2 , arise from the same Gaussian 
process that exists in the first (reference) cell. This probability is 

exp 

Now whiten the test data using the covariance matrix from the first, or reference, cell 
to obtain 

= (9.43) 

Then, it is straightforward to show that the covariance of the whitened data is the 
identity matrix, I, and 

p\(Z w ;Z 0 ) = —j^exp[-Z w Z^]. (9.44) 

Equation (9.44) can be used as a test statistic to determine if this test cell has the same, 
or a different, signal characteristic as the reference cell. If the test shows the test cell 
has a different signal characteristic, one can investigate the steering vector for the new 
signal in the following manner. 

Because the frequency cells are intended to be very narrow, it is appropriate to 
assume there is only one new signal in a test cell. But in whitened coordinates, the 
reference cell manifests an identity covariance matrix. Thus, a frequency cell that 
appears different from the reference cell can be modeled with a covariance £/ = 
I + P neuNnevN^ eW ‘ Eigenanalysis of this covariance produces a maximum eigenvalue 
of 1 + P neuN^ew^new* with its corresponding eigenvector providing the steering vector 
estimate V new . 

Finally, recalling that this covariance matrix model is in whitened coordinates, one 
must return to native coordinates to find the appropriate new signal steering vector. 
That is, 

V= Zfv new . (9.45) 

It is appropriate to note that the whitening expressed by equation (9.43) leads to a 
useful relationship between the covariance computed from the data in the reference 
cell and the covariance computed in the test cell. In particular, the data in the test cell, 


pl(^2 fe) • * * ^2ipM )5 £*) = 


•TWij 


M 

(u)x; l Z2(td . 

i=1 


(9.42) 
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after whitening, has the covariance matrix 

z. = E K ,l2 ZiZrX;‘' 2 ' (9.«) 

77Z 

= K 1/2 ZiK 1/2 ’ (9.47) 

which has the same eigenvalues as the matrix Y* c = E'^Ey, where Ey is the covariance 
computed in the test cell before whitening.^ 

Thus, the eigenvalues of this matrix E c can be used to assess the probability of the 
samples in the test cell coming from the same distribution as those in the reference 
cell. For this reason, E c is often identified as a change matrix . 

The steering vector identified in equation (9.45) can be used to determine the direc¬ 
tion of arrival of the signal in the test cell signal using the methods described earlier in 
this chapter. 

The process just described for assessing any difference between the signal environ¬ 
ment in the second frequency cell and that in the first frequency cell can be continued 
with all K frequency cells in the captured environment. A covariance matrix in a ref¬ 
erence cell is computed to characterize that cell; then samples from the adjacent cell 
are assessed using the processing described to determine whether the environments in 
the two cells are similar or different. 

This same approach can be used to detect and determine the direction of arrival 
associated with an intermittent signal. In this situation, the timeline is segmented 
into short intervals; a spatial covariance matrix is computed for the first interval, and 
the characteristics of the second interval are assessed using equation (9.42), just as was 
done with the first and second frequency cell. If there is a significant difference, the start 
of a new signal is declared and the steering vector estimated, as was done in frequency 
domain processing. The process continues interval by interval, looking to determine 
any new signal starts as well as the direction of arrival associated with such a new signal. 

9.6.2 Cyclostationary Feature Approaches 

Although, direction-finding techniques can be based on a variety of signal features, 
this subsection will focus on one particular feature, so-called cyclostationary signals 
displaying cyclic, or time-periodic, features [14]. Recall the definition of a stationary 

4 This is the result of the so-called cyclic permutation lemma, easily proved by noting that the eigenvalues of the 
product matrix AB, computed from the equation ABx = Ax, also satisfy, after multiplication by B, the 
eigenequation BABx = A.Bx, or BA(Bx) = k(Bx) with the same eigenvalue. 
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random process; such a process manifests statistics that are independent of the observa¬ 
tion time. Many processes encountered in communications are not, strictly speaking, 
stationary because they involve time-varying structures, such as periodic components. 
For example, because a binary-phase-shift-keyed (BPSK) waveform is constant for a 
symbol duration, the joint probability for the waveform values at two specified times, 
which are less than a symbol duration apart, will depend upon where in the waveform 
the first time is located. However, such a nonstationary waveform can be cyclosta¬ 
tionary because there is a periodicity frequency a for which the cyclic self-correlation 
function 



lim — 

T-> oo T 


f T1^ •£ •£ 

’ R x (t- 1—j t -) cxp{—j2rcat)dt 

-r/2 2 2 


(9.48) 


is not identically equal to zero. Here R x (t, u ) = E[x(t), x*(u)]? 

This cyclic self-correlation function has an important property that leads to a useful 
algorithm [15]: 

Lemma 1. Ifx{t) is a cyclostationary process with a self-cyclic correlation function Rf (r), 
and ify(f) = x{t + T ), then 


R^(r) —R%(r) exp(j2naT). (9.49) 

Because the differences in signal observations from one antenna element to another 
are primarily due to delays proportional to angle of arrival, this can be exploited to 
provide an estimate of signal source direction. 

Recalling that an observation at the i — th antenna element of a linear array with 
elements along the y-axis at locations {D t } is modeled as 


5 

*i(t) — s^t + Dj sin (j>k/c) + ni , i — 1,2, (9.50) 

k—\ 

the cyclic correlation at the i — th element is 



IS S 

( J2 skit + r / 2 + D - sin ^/c) ■ Yi Hf- 

\ k=\ 1=1 


r/2 + Dj sin <pi/c)e 


—jin at 


(9.51) 


5 The asterisk denotes complex conjugation. 
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Note that the noise terms have dropped out because the noise and source signals 
are assumed to be cyclically uncorrelated with each other. Further, if it is assumed that 
the source signals are mutually cyclically uncorrelated, this becomes 


Sa 


i^(r) = ^ k {r)^ 2lzaDi An4>i/c . 

k=l 


(9.52) 


Now form two element-based N-vectors, the first having components equal to 
these cyclic correlations for a given t, y(r), and the second equal to the phase delays 
associated with the £-th unknown source arrival angle. 


T 


y(r) = [R XJ (r) R X2 (t) ... R XSa (r)] 

_ ^jlTt clD\ sm{(p k )/c ^IkciDn sm{(j) k )/c J T 


(9.53) 

(9.54) 


The phase delays in equation (9.54) manifest the cycle a as if it were a carrier frequency 
of a narrowband signal. 

If there are S a sources with the cyclic feature a, their cyclic correlation functions 
can be arranged in an S a -vector 

s(t) = [Z?(t) #ty)] r . (9.55) 

Also, the corresponding S a column vectors a^, each with its associated arrival angle 
can be arranged in an N x S a matrix A = [a; a^ — a^], leading to the relation 

y(r)=As(r). (9.56) 

Computing the cyclic correlations at each element for L lags, one can form a “data 
matrix” with the columns corresponding to different lags and the rows to different 
antenna elements: 

Y(a) = [y(0) y(l) ... y(L-l)]. (9.57) 

Note the similarity ofY(a) to the usual data matrix in which each column is a temporal 
snapshot of a signal. Here, each column is the set of cyclic correlations at the different 
antenna elements. 

It should be apparent that Y(a) can be treated like the usual data matrix used for 
MUSIC or, if the array geometry meets the appropriate duality requirement, Esti¬ 
mation of Signal Parameters via Rotational Invariance Techniques (ESPRIT). Here a 
covariance matrix is formed from Y(of), 

K(a)=Y(a)Y(a) H = As(t)s H (t)A h , 


(9.58) 
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showing that the eigenvectors of lZ(a) span the same space spanned by the set of phase 
delay vectors exhibiting the cyclic feature a. As a reminder, this can be seen because 
the matrix product, s H (r)A H , has dimension S a rows by N columns representing a 
set of N weight vectors for the S a columns of A. 

Most importantly, if there are several signals present in the environment but only 
one exhibits the cyclic feature a, 1Z will be a rank-1 matrix, and the eigenvector asso¬ 
ciated with it will be the desired a. The arrival angle can be computed directly using 
the phase-fitting methods of section 9.2. 

Alternatively, as with MUSIC, one can compute a spatial spectrum by determining 
the rank of the signal subspace from the eigenvalues of 1Z and finding the distance 
of all length-N steering vectors a(0) from this subspace. Angles associated with close 
distances to the subspace, measured by peaks in the 1/(distance) 2 measure, correspond 
to the arrival angle of signals having the cyclic feature a. 

9.7 Eigenvector Rotation 

In section 9.4, the MUSIC algorithm was described. This technique rests on the 
concept of a signal subspace—that portion of the data vector space containing the 
signals—which complements the noise subspace containing noise alone. By means of 
eigenanalysis, the two subspaces can be identified, each characterized by a different set 
of eigenvectors, the signal and noise eigenvectors, respectively. The eigenvalues of the 
data covariance matrix can be used to determine the number of eigenvectors associ¬ 
ated with the signal subspace. Eigenvector rotation methods [7, 9, 8, 10] exploit the 
structure in the data vector space and particularly, this observation: because the signal 
eigenvectors span the signal subspace, they can be used to represent all of the signal 
steering vectors. Further, the specific eigenvector rotation approach proposed by Vez- 
zosi [7] assumes that each signal affects each array element with the same amplitude. 
This latter assumption may not always be valid at HF, leading to a de-emphasis of this 
technique in the toolbox of direction-finding techniques for the FIF environment. 
Nevertheless, it will be briefly described. 

Assuming that there are K signals, one can generate a model for the data covariance 
Rs based on a set of K steering vectors and an equal noise variance at each sensor or 2 , 

R z =APA^ + or 2 I. (9.59) 

If the noise model is different from one with independent and equal noise samples, 
appropriate modifications can be included. 
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Here A is an N x K matrix with columns containing the steering vectors of the K 
signals; P is a K x K source correlation matrix containing the source powers on its 
diagonal, and any source-to-source correlations in its off-diagonal, elements. 

It is straightforward to show that an eigenanalysis of R produces an alternative 
description, 

R z = E S E? + all, (9.60) 

in which the eigenvectors {e} and eigenvalues {A,} of are used to define the matrix Ey. 



(9.61) 


Further, the K x K correlation matrix P can be expressed in terms of the signal eigen¬ 
values of R^, through an eigendecomposition of P, by P = UAU^, where A is a 
diagonal matrix of the eigenvalues of R z , and U is a unitary matrix (UU^ = I); both 
A and U are K x K. 

Then, if one defines D = AUA i/2 , DD H = E S E?. This can also be seen by noting 
that since the K eigenvectors corresponding to the K largest eigenvalues span the signal 
subspace, the source vectors can each be rewritten as a different linear combination of 
these K eigenvectors. This immediately implies that the set of K source vectors can be 
written in the form D = E^B, where B is a K x K unitary matrix. The rows of column k 
of B provide the coefficients for the linear combination associated with the £-th signal. 

If the sources are uncorrelated, U = I, and A 1 / 2 = diag{p\, ...,<7^). Then, 
D=AA ; / 2 ,and 

D= [dj d 2 (9.62) 

where d k — cr^k- 

This suggests the notion of an effective steering vector, equal to the actual steering 
vector scaled by the signal eigenvalue 

As mentioned previously, Vezzosi [7] also observed that for arrays of practical size, 
a specific signal impinging on an array may not vary significantly in amplitude across 
the array elements. Under these conditions, all matrix elements in a specific column 
of D will have almost the same magnitude. Based on this observation, one can choose 
the elements of D, minimizing the variance of the elements in a column: 

KM 

/*=Xi!7i^i 2 -^) 2 ’ (9 - 63) 

k=\ 7=1 


where is the mean of the elements in column k. 
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For K — 2, Vezzosi showed that 



(9.64) 


For K > 2 Webster [11] described a multipass approach taking the eigenvectors in pairs 
and iterating. Alternatively, one can develop an iterative approach using the Matlab 
optimization toolbox. 

Once a solution for the matrix D provides the effective steering vectors, the esti¬ 
mated steering vectors can be obtained by rescaling using the signal eigenvalues. 
Finally, the desired source directions can be extracted from the components of the 
using the modeled phase relationships among the array elements. Such an approach is 
similar to the phase-fitting method described early in this chapter (section 9.2). 
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Geolocation Techniques 


10.1 Overview 

In many HF system applications, the goal extends beyond that of direction finding, in 
which estimating the line of bearing associated with an arriving signal is primary. Most 
often, the ultimate goal is to utilize two or more such DF results from well-separated 
sensors in order to estimate the actual transmitter location. Usually, for HF emitters it 
may safely be assumed that the transmitter is ground-based, leading to a requirement 
for only two sensors, each contributing an azimuthal estimate on a common signal. If 
the analyst has the luxury of more than two sensors, the overdetermined measurements 
allow a reduction in the geolocation estimate error. This approach to geolocation is 
covered in section 10.2 

Alternatively, there may be only one sensor available. If an azimuthal estimate is the 
only sensor output, then the intersection of the plane through the sensor with the Earth 
is a curve on the Earths surface. The analyst needs to identify a point on this curve 
somehow. Candidate measurement techniques leading to interesting, though perhaps 
not acceptable, positional results include 1) coupling an estimate of the arriving sig¬ 
nal elevation angle with ionospheric'mode predictions using ray-tracing techniques, 
and 2) measuring the differential delay associated with two separable multipath com¬ 
ponents at different elevation angles and a common azimuth. These techniques are 
covered in sections 10.3 and 10.4, respectively. Additionally, geolocation estimates can 
be derived using simultaneous differential range estimates from two pairs of cooperat¬ 
ing sensors, either two independent pairs or three sensors operating as two pairs with 
a common element. With this approach, the signal format is an important factor in 
positional accuracy as it drives the range measurement accuracy of each sensor. This 
approach is covered in section 10.5. 
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10.2 Multisensor Angle-Only Geolocation 

This section focuses on the utilization of azimuth-only estimates from two or more 
sensors to arrive at a geolocation result, an estimate of the latitude and longitude of 
a signal source. The primary concern here is the mathematics of the geolocation pro¬ 
cess. Important issues such as whether every one of the multiple azimuth estimates are 
associated with the same signal or whether any of the received signals arrive from the 
source as a result of refraction from a tilted ionosphere instead of the more normal 
spherical, Earth-centered, ionospheric model, are beyond the scope of this book. 

10.2.1 Coordinate Systems 

It is useful to define several coordinate systems [1]. One important coordinate system is 
an Earth-fixed geocentric system with its origin at the center of the Earth, the x-y plane 
corresponding to the equatorial plane, and the z-axis aligned with the Earths axis of 
rotation. A second coordinate system is a sensor- or target-based topocentric coordinate 
frame with its origin on the surface of the Earth, its x-axis to the east, y-axis to the 
north, and z-axis toward the zenith. 

A third intermediate coordinate frame has all axes parallel to its associated topocen¬ 
tric frame but with its origin displaced to the Earth center. Thus, the topocentric- 
geocentric pair of frames will describe the same point with equal x- and y-coordinates 
but a z-coordinate that is displaced by the Earth radius at the appropriate latitude and 
longitude. 

This intermediate coordinate frame, because it is related to other geocentric frames 
by a pure rotation, facilitates those transformations necessary for working with the 
usual Earth-fixed, geocentric coordinate system in which latitude and longitude are 
specified. These transformations will be described later. 

10.2.2 Sensor Azimuthal Measurements 

Measurements are generally made in a topocentric coordinate frame specified by the 
sensor latitude and longitude. Assume that there are J sensors located at latitude Vj 
and longitude Xj. Each makes an azimuth-only measurement <pj. The measurement at 
the^-th sensor defines a plane passing through the local sensor-based z-axis. 

This plane intersects the Earths surface in a great circle, and it will be assumed that 
the signal source measured to be at this azimuth lies somewhere on this circle. If there 
are two sensors, each making an azimuth measurement, there will be two great circles 
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that intersect at two points. The signal source is located, within measurement error, at 
one of them. 

If there are more than two sensors, say, J of them, each making an azimuth mea¬ 
surement, there will be/ great circles. Then, the geolocation result might be the point 
on the Earth s surface that minimizes the mean square distance to the set of great cir¬ 
cles. A useful alternative might involve a weighting of the azimuth measurements if 
they differ in quality. Further options could involve appropriate clustering processes. 
Although details for the more elaborate ways to combine multiple measurements to 
obtain a composite result are beyond the scope of this book, the reader should be aware 
of the possibilities. 

One procedure is to specify the equation of the plane corresponding to the azimuth 
measurement in the measuring sensors coordinate frame. Then, for each sensor, the 
intersection of this plane with the Earths surface can be computed numerically. This 
three-dimensional curve, or trace, 1 can be transformed to a common geocentric frame, 
usually the standard equatorial frame in which the sensor s latitude and longitude are 
specified. 

Now for the details. In they-th sensor coordinate frame, the equation for the plane 
corresponding to an azimuth measurement <pj is given by 

yj sin 4>j — Xj cos <pj = 0. (10.1) 

The intersection of this plane with the Earths surface, which because of its imperfect 
spherical shape is often described numerically, is a curve in three-space. This curve is 
a trace on the Earths surface. The computation can be visualized using the geometry 
shown in figure 10.1. A set of x values start the process; for each x value, the corre¬ 
sponding y value in the estimated azimuth plane is computed from equation (10.1). 
Then, the corresponding z value is determined from the numerical Earth model in site 
coordinates. 

If the Earth could be regarded as a perfect sphere of radius R e , this last step is 
simplified. First, the Earth centric angle 8 is computed from tan 8 — (x 2 +y 2 ) l ^ 2 /R e ; 
then the required z values are found from cos 8 — ( R e + z)/R e (note that z is negative 
in this relation), or £ = —R e { 1 — cos 8). 

After appropriate coordinate transformations to equatorial coordinates, described 
in the next section, this trace can be used to represent the measurement from a specified 
sensor. With two sensors, their intersection determines the desired geolocation result. 


1 Note that the use of the term “trace” in this section is a graphical term and should not be confused with the 
trace of a matrix. 
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Figure 10.1 Geometry for the intersection of a sensor azimuthal plane and the Earth 
surface 

Figure 10.2 illustrates the concept. The details associated with this figure are covered 
in section 10.2.4. 

A straightforward numerical approach is to generate a matrix representing the dis¬ 
tances between each point of the two traces. Then, the minimum entry in this matrix 
can be found at the desired intersection. Step quantization in the trace computation 
is the primary limit to numerical computation accuracy. Of course, measurement 
accuracy is also a critical factor in geolocation estimate accuracy. 

If there are more than two sensors, it is necessary to find the Earth surface location 
that minimizes the error among the intersecting planes. This minimization can be 
performed as a numerical search using a two-sensor solution as a starting point. A 
candidate geolocation estimate is evaluated by computing the nearest distance from 
this point to each of the sensor Earth intersection traces and summing the squares of 
these distances as an evaluation metric. The candidate geolocation estimate with the 
minimum of this metric is used as the final geolocation solution. 

A more mathematical approach might be based on transforming the equations 
corresponding to the azimuthal measurement planes into equatorial coordinates and 
developing a normal equation solution to the intersection of the multiple planes under 
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Figure 10.2 Global coordinate plot of sensor traces for two-sensor example 


the constraint of lying on the Earth surface, represented by the equation of a sphere 
in equatorial coordinates. However, such an approach does not easily lend itself to 
extensions for oblate spheroidal or numerical Earth models and thus will not be 
pursued here. 

10.2.3 Sensor to Equatorial Coordinate Transformation 


Transforming from sensor coordinates to geocentric coordinates involves the ori¬ 
gin shift from the topocentric sensor system to the intermediate coordinate frame 
described earlier. This is merely a translation of the z-coordinate by the Earth radius 
so that . 


Y*. - y* • 

(10.2) 

yi =yj> 

(10.3) 

Zj — Zj T Re> 

(10.4) 
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Figure 10.3 Coordinate systems associated with multisensor geolocation 


where (x^y^zi) are intermediate coordinates, and {xj,yj,Zj) are the topocentric 
coordinates. 

From the intermediate frame, the transformation to a common equatorial frame 
can be specified by two rotations, the first by an angle equal to the negative of the 
sensor colatitude (n /2 — v) about the intermediate x-axis and the second by an angle 
equal to the negative of the angle (n/2 + X), where X is the sensor longitude, about 
the z-axis. These rotations are sketched out in figure 10.3. The first rotation is specified 
by the matrix 


and the second by 



0 

sin v 
cos v 


0 

— COS V 

sin v 



— cos X 0 

— sin X 0 

0 1 


(10.5) 


( 10 . 6 ) 


The complete translation is then the translation specified by equation (10.4), followed 
by two matrix multiplications using the matrices in equations (10.5) and (10.6). 
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10.2.4 Geolocation Example 


Assume that Sensors A and B are located at latitude-longitude coordinates 
40°N, 70°W and 30°N, 100°W, respectively. Sensor A estimates that a signal arrives 
at an azimuth of —25°, while Sensor B estimates an azimuth of +25°. Figure 10.2 
illustrates the geometry. 

The azimuthal measurement planes for the two sensors, in their local coordinate 
frames, are given in equation (10.7). 


0.9063* + 0.4226jj/ = 0 Sensor A, 
0.9063* — 0.9226j/ = 0 Sensor B. 


(10.7) 


Each of these two planes intersect the Earths surface in a curve or “trace.” After 
coordinate transformations to an equatorial frame, the intersection of these two traces 
results in the desired geolocation estimate. This intersection can be found numeri¬ 
cally from the matrix of trace-point to trace-point distances, which is shown for this 
example as a surface in figure 10.4. The resulting geolocation estimate corresponding 
to the minimum of this matrix is the point 50°N, 85°W. Clearly, there will be some 
error, but that error reflects the. uncertainty associated with the two azimuth mea¬ 
surements. Figure 10.2 illustrates the process with a plot of the sensor traces in global 
coordinates. The result in this example produces an intersection of the two traces at a 
latitude of 53.6° and a longitude of —80.6°. This result can be checked by comput¬ 
ing the azimuths associated with an emitter from these coordinates; these azimuths 
are —25° for Sensor A and +25° for Sensor B, in line with the measured azimuths 
specified in the example description. But in this example with two sensors, this is not 
surprising because the x-y coordinates used to compute the Earth intersection traces 
were constrained to the measured azimuth plane. 

The multisite technique described in this section is a costly approach to the geolo¬ 
cation problem. First, there is the requirement for multiple sensors; their sites must 
be well distributed about the target in order for their intersecting planes to be close 
to perpendicular, and as a result, the error ellipse associated with any measurement 
errors is nearly spherical rather than an elongated ellipse; this issue is often described 
using the term geometric dilution of precision ( GDOP ). Second, there must be appro¬ 
priate communication links to allow the data from the multiple sensors to be brought 
together for processing and generating the desired fix. Third, protocols must be in 
place to assure that the multiple sensor-derived planes are all based on the same target. 
Otherwise, the oft-quoted “garbage in, garbage out” adage applies. 
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Figure 10.4 Trace-point to trace-point distance contour for two-sensor example 

Avoiding these complexities makes it desirable to generate geolocation estimates 
using a single sensor, but the potential for enhanced accuracy using multiple sensors 
may be worth the complexity for some applications. The alternative to a multisite 
approach, using a single site, is described in the next section. 

10.3 Single-Site Geolocation 

Single-site geolocation, often identified as SSL, is one approach to avoiding the costs 
associated with multiple-site geolocation, even though the final result may lack the 
accuracy of multisensor geolocation. And the two methods can be used in concert to 
improve estimation accuracy. 

10.3.1 Basic SSL Ranging 

The basic approach to single-site geolocation has its roots in early experiments by 
Appleton and Barrett [2] (see [4]) designed to prove that the ionosphere exists. These 
experiments were focused on making estimates of the ionosphere s altitude using two 
sites at known locations; thus the range between transmitter and receiver was known. 
In these experiments the goal was to estimate the ionospheric altitude. SSL reverses 
the computation to estimate range using measurements of ionosopheric altitude and 
elevation. In the early experiments, Appleton and Barrett employed the basic principle 
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Figure 10.5 Geometry for single-site ranging 


by measuring the elevation angle of an arriving “atmospheric ray” from a transmitter 
at a-known distance. 

With an elevation estimate, the height of the ionosphere could be inferred by solving 
the geometric problem portrayed in figure 10.5. This diagram suggests that a single sen¬ 
sor makes azimuth and elevation estimates on an arriving signal, or “ray.” The azimuth 
estimate defines a plane of propagation, leading to a great circle path between the trans¬ 
mitter and receiver. The geometry in this azimuthal plane shows the elevation angle 
measured at the sensor, the ionospheric height, and the transmitter location. In the 
Appleton-Barrett experiments, the elevation angle e would be measured. The Earth- 
centered angle would be known from the transmitter-receiver separation distance, and 
the triangle could be solved for the distance from Earth center to the ionosphere. 
Subtracting the Earth radius would provide the desired ionospheric height. 

Alternatively, if the ionospheric height were known, the transmitter-receiver dis¬ 
tance could have been found from an elevation measurement and the ionospheric 
height. There is no evidence that Appleton and Barrett made such calculations; this 
potential step-forward was experimentally explored by Treharne [4] in 1958—1959. 
Later works [6, 8, 7, 9] presented additional experimental results to characterize 
performance. These efforts suggested that a range error on the order of 50 km 
might be representative of this technique. Disappointing accuracy results from the 
aforementioned experiments, as well as others done in conjunction with the more 
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complex techniques covered in the next sections, have pushed this approach into the 
background. 

Nevertheless, to be complete the relevant computations are given here. The geome¬ 
try of figure 10.5 can be solved to estimate the transmitter distance D from the receiver, 
assuming a spherical Earth with radius R e> 

( R e cos i} \ 

D — 2 R e arccos (--- & ) . (10.8) 

\Re + hi on ) 

10.3.2 Vertical Aperture Ranging 

It should be obvious that the geometry involved in elevation estimation, when using a 
horizontal aperture for the low elevation angle signals associated with long-range HF 
communications, presents severe challenges. In an attempt to overcome the limitations 
of a horizontal aperture, Gething [5] conducted some interesting experiments using a 
vertical aperture equipped, first, with an array of three and later, six tower-mounted 
horizontal loop antennas. Because the tower was constructed of wood and because 
the horizontal loops were carefully designed and implemented, the experimenters felt 
confident that a mathematical representation of the vertical antenna patterns could 
be associated with the elements. Controlled emitters, broadcasting both continuous 
wave (CW) and pulsed signals, were used to estimate elevation angle; Gething hoped to 
characterize the “best practices” elevation estimation potential for HF communication 
signals. 

Gething utilized a method described by Wilkins and Minnis [3] to make elevation 
angle estimates. This method models the response of a horizontal loop, in free-space, 
as omnidirectional in the horizontal plane and proportional to cos e in the vertical 
plane. The response of such an element when mounted at a height h on the tower 
must also include the effect of the element image in the ground plane. Gethings tower 
was located at the top of a slightly conical hill, with a slope of 1°, instead of on an 
absolutely flat horizontal ground plane; elevation corrections were made in the model 
to account for the small conical characteristic of the ground plane. 

With this model for the element pattern, it is straightforward to show that the 
response of an element to a signal arriving with elevation 6 is 



S — A cos € sin 


(10.9) 
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With this model for the response of a single element, one can consider the ratio of the 
responses at two elements 

51 sin re n 

5 2 sin e n ’ 

where r — hj/hx, and € n = 2irh\ sin e/k. Gething measured the ratio of the element 
signal amplitudes using appropriate instrumentation and used this ratio to estimate 
the elevation angle 6. 

When Gething employed a six-element array, he utilized an alternative method that 
foretells the MUSIC approach. In particular, 32 of the 360 possible combinations of 
two elements from the array were cycled through with a digital switch and the magni¬ 
tude of their ratio displayed on an oscilloscope. The 32 element pairs were selected by 
computing, for a value of/sin 6, the ratio of element responses as a function of this 
parameter, for 400 steps from 0 to 20, in steps of 0.05. For a given value, a pair were 
chosen if they produced a null at that parameter value but were different otherwise. At 
some values of f sin 6, no null could be found; when available, two pairs were chosen 
for redundancy. 

A critical issue in conducting these experiments concerned multimode propagation. 
When two or more modes are present, the pattern is no longer a reliable estimate 
of elevation angle but becomes an interference pattern between signals reaching the 
receiver by way of two or more separate paths. Generally, however, such modes have 
differential delays in the range 0.1-1.0 ms so that with On-Off keyed signals, reliable 
estimates could be obtained at the start and end of a pulse. Such an approach was used 
in the Gething experiments when necessary. 

From these experiments, Gething concluded that elevation errors could be as accu¬ 
rate as ±1°. He also foresaw the potential of future work on wave front analysis 
methods. 


( 10 . 10 ) 


10.4 Mode Differential Geolocation 


When the ionospheric reflecting layer height is not available, either because appro¬ 
priate soundings are not available or those that are available appear to be unreliable 
due perhaps to^multimode propagation phenomena, it is possible to estimate the 
transmitter-to-receiver range by combining elevation angle-of-arrival estimates with an 
estimate of the relative delay between the same signal received via two separate prop¬ 
agation modes. Such a technique is based on the Breit and Tuves theorem, which was 
introduced in chapter 2. This mode differential method, based on experimental work 
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that used a superresolution approach to estimating the complex cepstrum and hence 
the intermode delay, was described by Johnson, Black, and Sonsteby [10]. This section 
outlines this approach to estimating the transmitter location using a superresolution 
approach to estimating intermode delay. 

Recall from chapter 2 that the modulation on a wave propagating in the ionosphere 
travels at the group velocity, which is slower than the phase velocity such that the 
product of group and phase velocities is equal to the speed of light squared. The Breit 
and Tuves theorem shows that the group delay along a ray from a transmitter to a 
receiver on the Earths surface can be determined from the distance between them, 
D , and the elevation angle of the ray at the receiver, 6, and is equal to Z)/(csin(e). 
Conversely, if the delay and elevation angle are measured, the range can be estimated. 
This relationship is key to the method described here. Note that the height of the 
refracting ionospheric layer does not enter into this calculation. 

The derivation of Breit and Tuves theorem disregards Earth curvature and magnetic 
field effects. The theorem states that a wave packet s path to a point in the ionosphere 
is equivalent to a straight path directly above that point. This fictitious path is termed 
the triangulated path. Detailed measurements on actual ionospheric paths have shown 
that the error introduced by ignoring Earth curvature and magnetic fields, although 
introducing some error, does not greatly affect overall geolocation accuracy [11, 12]. 
Figure 10.6 illustrates Breit and Tuves theorem showing the ionospherically refracted 
path from transmitter (T) to receiver (R) for an ionospheric layer at a height h. P' is 
the path from which group delay is calculated. 



Figure 10.6 Geometry of Breit and Tuve’s law 
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Figure 10.7 Diagram of rays involved in differential mode delay geolocation approach 

10.4.1 Differential Mode Delay Estimation 

In this section, Breit and Tuve s theorem is exploited by processing a signal reaching 
two separate antennas via ionospheric refraction. As indicated earlier, this approach 
was investigated by Johnson et al. It will be assumed that ionospheric conditions are 
such that each receive antenna observes the signal reaching it by way of two separate 
propagation modes. In total, there are four rays, as indicated in figure 10.7. It will 
also be assumed that the two receivers have access to each other s data. Then, it will be 
possible to estimate the delay between the modes at each sensor, as well as the various 
intersensor delays. 

At the first sensor, the received signal is 

z\ ( t) — A\s(t — ti) + A 2 s{t — x 2 ) (10.11) 

= Ais(t') +A 2 s(t f - r 3 ). (10.12) 

Here, Aj, i = 1,2 are the attenuations from transmitter to receiver, and X\ is the delay of 
the signal received via Mode 1, while x 2 is the delay of the signal received via Mode 2. 
x’ shifts the time origin so that the Mode 1 signal reaches sensor 1 at time t r = 0, and 
thus x$ = x 2 — X\. 
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At the second sensor, the received signal is 


z 2 (t) —A\s{t — x\ — 8{) +A 2 s(t — x 2 — 8 2 ) (10.13) 

=A\s(/ — 8 \) + A 2 s(/ — T 3 — S 2 ), (10.14) 

where 8 \ is the additional delay to Sensor 2 relative to Sensor 1 for Mode 1 , and 82 is 
the additional delay for Mode 2 . 

If the Fourier transform is taken of these signals, the results can be used to compute 
a normalized cross power spectrum between the two sensors. 

T{z 1 (t)}=F 1 (co) = (A! + A 2 e~j° )X?> )F{co) , (10.15) 


and 

T\z 2 {t)) = F 2 (co) = +^ 2 ^ 7<u(T3+32) )^(^) ; (10.16) 


where F{co) is the Fourier transform of the signal, Using these spectra to 

compute a normalized cross power spectrum, the result is 


Sl2 (co) = 


Fi(o)F 2 (cq) 
\Fi(co)\ • \F 2 (cq)\' 


(10.17) 


Johnson et al. suggests the normalization in the denominator of equation (10.17) to 
reduce the effects of modulation. 

If each of the Fourier transformed signals is written in the form Fj(co) = AiFico ), 
the normalized cross power spectrum, equation (10.17), can be written in polar form, 
with unit magnitude. 

Sn(co) = arg AjA * 2 = (10.18) 

The complex argument of S\ 2 (co) has a periodicity that depends on the intersensor 
delays and the interpath delay, with the amplitude of the periodicity a function of the 
relative amplitudes of the two propagation paths: 


(f)(0)) 



A | sin cox\ +A 2 sin cox 2 "F 2A\A 2 cos •( 2^3 -F 8 2 — <5^) sin ^{8 2 ~F ^i) 

Aj cos cox\ +A 2 cos cox 2 + 2A\A 2 cos y (2r3 + S 2 — 5 i) cos ^ (^2 + 5 i) 

(10.19) 


Johnson et al. [10] utilized the MUSIC superresolution algorithm to carry out a 
decomposition of this cross power spectrum because the signals of interest are rela¬ 
tively narrowband and hence are typically only observed for a few periods of the signal. 



t 
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Because there is also noise contaminating the cross power spectrum, it is appropriate 
to express Sn(co) as 

S n (co) = c^ (co) + N(oj-), ' (10.20) 

where N(co) is a complicated expression involving noise-cross-noise and signal-cross¬ 
noise terms. Johnson et al. assumed that this corrupting noise could be modeled as 
additive, white Gaussian noise; it is also assumed to be uncorrelated with the signals. 

From the cross power spectrum Suico), select M frequency bins containing energy 
from the signal of interest. Use these to form a vector. 

s 12 (a)) = : + ! . (10.21) 

JHum) J [_N{com)_ 

Then form the outer product of this vector with itself to obtain C(<y) = 
E[s 12 (co) s^(n/)]. Proceed to perform a spectral decomposition of C(<z>) using an 
approach similar to that used with MUSIC when analyzing the spatial covariance 
matrix for steering vector estimation. This is done, as in the MUSIC steering vec¬ 
tor estimation algorithm, by means of an eigendecomposition of C(to). If there are 
two coherent paths, C(&>) will be'of rank one and will have a null-space of dimension 
M — 1 . But in general, there will be more than two coherent paths, leading to a some¬ 
what higher rank. Assume for the present that there are only two coherent paths and 
form the null-space matrix C/y constructed using the M — 1 eigenvectors correspond¬ 
ing to the M — 1 smaller eigenvalues of C (00) as its columns. The desired spectrum can 
be found as 

V = (b* QvC$b)- ; , ( 10 . 22 ) 

where the “steering vector” b is defined by 

e j<P((02) 
gj(p ((°m) 

Here the complex arguments <p(cOj) are given by equation (10.20). The goal is to find 
the peaks of the spectrum V that provide an estimate for the differential delay time 
T 3 after a four-dimensional search over the four unknown parameters, A 1 /A 2 , S \, 82 , 
and T 3 . This search is in contrast to the two-dimensional search over the azimuth and 
elevation parameters of a spatial manifold in the steering vector estimation context. 
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When there are more paths, the search will be more extensive; for three paths, the 
search is over seven parameters, two relative amplitudes, three intersensor delays, and 
two differential path delays. 


10.4.2 Range Estimation from Differential Mode Delay 


With this superresolution approach to finding the differential mode delay, it is straight¬ 
forward to apply the Breit and Tuve theorem to estimate the range from a sensor to the 
transmitter. Recall that the elevation angles €\ and 62 were available for the two paths. 

For the first path, the delay P[ is given by P[ = D/ cos €\ = ct\ while for the sec¬ 
ond, the delay P 2 is given by P 2 = Dj cos 62 — ^ 2 - Taking the difference of these two 
expressions, 

D 

t 2 — t\ = T 3 = —(sec 62 — sec 61 ), (10.24) 

c 


leading to 



3 

(sec 62 ~ sec 61 ) ’ 


(10.25) 


the goal of this exercise. 

The paper by Johnson et al. provided a few experimental results showing that the 
method was feasible. They reported on a sea path experiment [10] in which shipborne 
transmissions, with 3 kffz bandwidth, were observed at four locations, 225 km from 
the ship and spread over a 4.2A aperture. FFT processing of these measurements, 
followed by cluster processing of the peaks, produced tight clusters with differential 
intersensor delay spreads of about 0.2 /xs; this corresponds to a range error of 60 m. 
Ffowever, the scatter of the corresponding range estimates showed a systematic bias 
of nearly 18.5 km. At the same time, the azimuth estimates manifested a bias of 8 . 6 °. 
As a result, the geolocation results fell short of expectations, with an average miss dis¬ 
tance of 39.6 km. Nevertheless, the technique is interesting and does provide some 
single-site geolocation capability. 


10.4.3 Mode Differential Geolocation Summary 


Considering the lengthy derivation associated with the differential delay estimation 
process, it is appropriate to summarize the procedure: 

• Obtain the spectral responses from the signals received at two sensors. 

• Compute the normalized cross power spectrum S\2 (co). 
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• Select appropriate frequency bins associated with the signal of interest and form 
a cross power spectral vector. 

• Compute the spectral cross-correlation matrix G using these bins and appropri¬ 
ate time averaging. 

• Perform eigenanalysis of this matrix to determine its null-space. 

• Set up a steering vector search based on the unknown parameters. 

• Find the peak of this multidimensional spectrum and obtain the corresponding 
value of the differential path delay, 13. 

• Using this intermode delay and the elevation angles associated with the two 
modes, estimate the range to the transmitter using equation (10.25). 

• Combine the range and azimuth to estimate the transmitter location. 

10.5 Multisite Differential Time-of-Arrival Geolocation 

The preceding approaches to geolocation all suffer from the angle estimation inac¬ 
curacy associated with the relatively small number of wavelengths characterizing the 
sensor aperture. One proposed alternative that avoids this shortcoming is based on 
differential time-of-arrival (DTOA) estimates on the same signal obtained from two 
pairs of sensors. Each sensor in a pair provides a baseband estimate of the signal wave¬ 
form, including its absolute time of arrival at the sensor, using the techniques covered 
in chapter 12. Even though the actual transmission time for this signal is unknown, 
these two baseband signal estimates can be processed to provide an estimate of the 
relative time of arrival at the two sensors. This concept is illustrated in figure 10.8. 

It is well known that the locus of all possible transmitted signal locations associated 
with such a time of arrival difference corresponds to a hyperbola of revolution. The 
axis of this conic section is specified by the straight line between the two sensors. 
The bisector of this line corresponds to a DTOA equal to zero. 

Now if there are three sensors, two independent DTOA estimates can be produced 
for two of the three pairings. This implies that there are two hyperbolas of revolution, 
and these intersect in a curve in space. The intersection of this curve with the Earths 
surface produces the desired geolocation estimate. 

Figure 10.^illustrates this approach to geolocation. This diagram shows the portion 
of the Earths surface in the vicinity of three sensors, each indicated by a small circle. 
One set of contours shows the DTOA hyperbolas associated with the Sensor 1-Sensor 
2 pair of sensors, in DTOA steps of one millisecond; the other set shows the corre¬ 
sponding DTOA hyperbolas for the Sensor 2-Sensor 3 pair. If a transmitter is located 
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Line of constant 




Figure 10.9 Differential time-of-arrival contours for two site pairs, time step = 1 ms 
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at 20° latitude and 50° longitude, as indicated by the small x, the differential time 
of arrival for its observed signal would be +0.64 ms for the Sensor 1-Sensor 2 sensor 
pair and —1.51 ms for the Sensor 3-Sensor 2 sensor pair. The reader can see that these 
observations are consistent with the DTOA contours. 2 

Time-of-arrival estimate accuracy is driven primarily by the bandwidth of the sig¬ 
nal of interest, which for HF signals is on the order of 5 kHz; this translates to a time 
estimate resolution on the order of 200 /zsec, corresponding to a differential range 
uncertainty, 8r = c8t = 60 km. Subresolution cell accuracy of ten to fifty times smaller 
than this can be obtained if the signal-to-noise ratio is adequate. Potential exists for a 
final DTOA accuracy of 1 km. Compare this to the angular beamwidth of a sensor with 
a 90 m aperture, operating at 10 MHz; that is, a 30 m wavelength, for a beamwidth 
of about 20°; if the beam is split by a 10:1 factor, at 3000 km range the angular mea¬ 
surement accuracy would be on the order of 100 km. At lower frequencies, the angular 
accuracy becomes even poorer; at the bottom of the HF band (2 MHz), the situation 
is ten times worse and at longer range, even more so. Thus, it is appropriate to consider 
methods, such as differential time delay, that do not rely on antenna beamwidth to 
produce accurate geolocation estimates. 
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Copy: Steering Vector Methods 


11.1 Overview 

In addition to the direction-finding products associated with an HF sensor, there is 
strong interest in receiving the signal transmitted over the airwaves and demodulating 
the intelligence it bears. This topic is covered under the rubric signal estimation ; the 
term copy is often used in reference to this process. The challenges with achieving these 
objectives at HF include the weak and variable signal strength, the varying character 
of the ionosphere, and the myriad of interfering signals present in the environment. 
As outlined early in this text, array antennas have the potential for creating an antenna 
pattern that provides gain on the signal of interest while placing nulls in those direc¬ 
tions associated with unwanted, interfering signals. The design of such patterns is 
the primary topic of this chapter. Many algorithms on which to base this design are 
available; the most common and useful at HF are included here in separate sections. 
The focus is on linear-processing approaches, which determine a weight vector that 
is applied to the array elements in a vector-summing process; the result is a single¬ 
output signal intending to represent the desired signal estimate. Because this structure 
produces a response that varies spatially, this result is in effect a beam, and the entire 
process is identified as beamforming. 

Recalling the overall system architecture described in chapter 5, signal copy is a process 
employed after a signal is detected, as detailed in chapter 7, and its direction estimated 
using one of the methods described in chapter 9. Often, short segments of a newly 
detected signal might be copied to assist in resource allocation. The end user might want 
to know something about such a detected signal before collecting enough data upon it 
for precision direction estimation and extended copy. Nevertheless, the methods used 
for short-term copy would be based upon those used for extended collection. 
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In chapter 8, the most common approaches to parameter estimation were reviewed 
as a preview to direction-finding approaches. Since signal estimation, or copy, is essen¬ 
tially a process of estimating a signal vector, the foundations for the methods to be 
discussed in this chapter are similar, if not identical, to those discussed there. Thus, it 
is appropriate to jump right into the beamforming topic; the first approach is based 
on a mean square error methodology. Figure 5.1 shows where the signal estimation, or 
copy, process falls in the overall system block diagram. 

11.2 MSE Beamformer 


Assume that a signal of interest s(t) impinges on a sensor array to produce a received 
signal vector z that includes this signal, modified by the array steering vector associated 
with the signals angle of arrival, as well as corrupting noise. 

z = as(t)v + n(t). (11.1) 


As discussed in chapter 6, a beamformer computes an output signal by applying a 
complex weight vector to the array element signals and summing the weighted com¬ 
ponents, an inner product of a weight vector w MSE and the received signal vector z. 
That is, the estimate of each signal sample s(t) is to be based on a weighted sum of 
the array element signals, expressed in vector form as s(t) = w^-^z (t). A mean square 
error (MSE) beamformer determines a weight vector that minimizes the squared error 
of the signal estimate compared to the true signal. With this estimate for the signal, 
the error is the difference between the true signal and s, 

e{t) = s(t) - T*M SE z(t). (11.2) 

Squaring, 

E[e 2 (,t )] = E [K?) I 2 - 2w^ S£ z (t)s(t) + w^ SE z(t)z H (t)w M SE] . (U-3) 

leading to, after taking the expectation, 

E[€ 2 (t)] —S- 2w^ SE r rs + Wm SE Rvtmse> (11-4) 

where r„ is a column vector of the correlations between the known signal reference 
component and the received element signals, 



■gVMf) 

Z 2 (t)s(t) 


(11.5) 
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and R is the usual covariance matrix. Here, <S is the power in the signal component 
that correlates with the unknown signal. It is independent of w and thus can be ignored 
in what follows. 

As suggested earlier, the goal is to minimize this mean square error. It is thus appro¬ 
priate to compute the gradient of the expression in equation (11.4) and to set the result 
equal to zero. 

Y w (<? 2 0)) = — 2r„ + 2Rw M se = 0 (11.6) 

so that 

VTMSE = Rlr »v (11-7) 

With this set of phase and amplitude weights for the received array element signals, 
the estimate of the transmitted signal becomes 

s(t) = w^sEzit). (11.8) 

Note if the reference is the only signal component that correlates with the received 
array element signals; that is, 

r rs = Sy. (11-9) 

v is the steering vector associated with the signal reference. As a result, 

^MSE — SR^v. (11.10) 

This shows that the MSE copy weights are proportional to the signal-of-interest 
steering vector, whitened by the estimated noise covariance. 

11.3 Steering Vector-Based Copy 

As described in chapter 9, the array elements each receive the composite signal, which 
is used to determine its direction of arrival and polarization characteristic. These char- 
acteristics are summarized in the steering vector estimate matrix V. Using matrix 
notation, a least mean square estimate for the signal of interest (SOI) signal can be 
found starting from the solution of the equation 

N s 

s(/)=argminV ||x(/) — Vs(/) || 2 . (11.11) 

k—\ 

In this expression, V is the set of steering vectors resulting from the direction-finding 
process described in detail in chapter 10. Using this steering matrix for the set of signals 
impinging on the array, one can determine a least mean square fit to the data using 
the standard normal solution 

s(0 = (Uv) _1 Ux(/). 


(11.12) 
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Recall that V has dimensions N x N s and that s(/) is a vector of length N s . One can 

define a signal matrix S with the {s(/)} making up its rows. It is instructive to develop 

/\ 

an expression that estimates the SOI alone, rather than the entire matrix S. Such an 
expression reduces the computational load, and its form provides insight into how the 
signal estimation process operates. 

Following the approach taken by Weiss and Friedlander [7], the steering matrix V is 
partitioned into an N x (N s — 1) matrix, B, and a vector, v, of length N, the steering 
vector associated with the SOI. 

V=[B v]. (11.13) 

In this formulation, v is assumed to be the last column of V. Using B, one can define 
a projection matrix onto its column space and another projection matrix onto its 
null-space P]y. 

= (11.14) 

Py\r = I — P#. (11.15) 


With these projection matrices, one can compute the coefficient of x(/) in equa¬ 
tion (11.12) as a partitioned matrix 


(yHy\ -lyH = 1 (B^B) J B h [ (v^Pn v) I - w^Pn] 

V ' v^P N v v^Pn 


(11.16) 


leading to the following expression for the last row of s(/), the time series associated 
with the SOI. 



v^Pn x(/) 

v^Pnv 


(11.17) 


Recall that the columns of the matrix B represent steering vectors to each 
of the interfering signals that are strong enough to have been isolated in the 
direction-finding/polarization estimation process. Its null-space is the portion of the 
TV-dimensional received signal space that contains no interferer contributions. Thus, 
the matrix product Pnx(/) has no contributions from signals in the interferer direc¬ 
tions; this product is a vector of contributions from the individual sensors. Premulti¬ 
plication by the steering vector v H adds up these sensor contributions with appropriate 
phase and amplitude weights as dictated by the SOI steering vector. 
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11.4 MUSIC-Based Copy 

In chapter 8 several direction-finding algorithms were.described in detail. One of the 
most effective, and hence popular, of these is the MUSIC algorithm. MUSIC pro¬ 
vides indications of those directions from which plane waves are impinging on an 
array. The peaks of the so-called MUSIC spectrum suggest that the signal environment 
includes energy arriving at the array from specific directions. The next goal might be 
to estimate the signal waveform associated with each of the directions corresponding 
to those MUSIC peaks. This section considers how the eigenstructure associated with 
the MUSIC DF algorithm can be exploited for such beamforming purposes. 

11.4.1 Conceptual MUSIC-Based Beamformer 

In most array processing architectures, signal estimates are computed as the weighted 
sum of the array element signals. In section 11.2, based on a mean-square error 
criterion, the weights were shown to be a noise-whitened vector characterizing the 
interference environment and the signal direction. However, the identification of an 
appropriate signal-direction vector was not discussed. One possibility is the steering 
vector associated with a peak in the MUSIC spectrum. Since that peak is specified by 
an azimuth and elevation, there is an associated steering vector in the array calibration 
table. Utilizing this steering vector generates a weight vector to function as a beam- 
former, computing a weighted sum of the array elements in a way that provides array 
gain in the chosen direction but also, as a result of the whitening, minimizes array gain 
(possibly creating nulls) in directions associated with those signals in the environment 
that were not chosen for favorable treatment. 

Such an approach has two shortcomings. First, array weights based on the calibra¬ 
tion table alone do not recognize cochannel interference that should be minimized if 
the benefits of the array architecture are to be realized. This shortcoming is addressed 
by the whitening process. Second, any errors in the MUSIC direction-finding process 
will be reflected in the copy results, compromising its quality. Thus, a different tack 
can be taken to utilize the signal and noise subspace concepts inherent in the MUSIC 
algorithm but recognizing the possibility of array errors. 

Recall how MUSIC computes its spectrum. It starts with a covariance matrix com¬ 
puted from the array output vector. Eigenanalysis of this covariance matrix, followed 
by signal number estimation, determines the eigenvectors associated with the largest 
eigenvalues and those associated with the remaining “noise” eigenvalues. The “modes” 
of the covariance matrix associated with the larger eigenvalues provide clues to those 
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environment components that contain the most energy. Assume that the MUSIC spec¬ 
trum was computed using the eigenvectors associated with the noise eigenvalues that 
specify the “noise 55 subspace; then, the lines of bearing specifying directions associated 
with emitters can be estimated by finding steering vectors orthogonal to this noise 
subspace. Such steering vectors specify the directions of signals, and those signals can 
be estimated by applying their steering vectors to the array data while at the same time 
nulling all other signals by invoking the inverse covariance matrix as a whitener. As 
a consequence, the array attempts to reject signal energy coming from the interferers 
while providing array gain in the signal direction. This can be done with each steering 
vector associated with a MUSIC spectral peak. 

This concept can be described mathematically as follows. A maximum likelihood 
approach to MUSIC-based signal estimation can be based on maximization of the 
log-likelihood function, which is proportional to 

q(Z-VS)(Z-VS) // ] = (Z-VS) i/ (Z-VS). (11.18) 

Maximizing this expression involves taking the divergence and setting it equal to zero. 

V s [Z /7 Z - S H \ H Z - Z H VS + S^V^VS] = 0, (11.19) 

which implies 

(V H V)S = V H Z, (11.20) 

leading to 

S=(V H V)~ ] V H Z. (11.21) 

If a set of weight vectors for the N s signals is specified by W=VfV^V) -7 , where 
the N s vectors V(k) making up the columns of V are chosen to be the array vectors 
corresponding to the peaks of the MUSIC spectrum, then MUSIC-based estimates 
for the unknown signals become 

S=W H Z. (11.22) 

Unfortunately, this approach to a MUSIC-based beamformer leads to poor perfor¬ 
mance because the prestored array steering vector often does not precisely correspond 
to the actual array excitation in the true signal direction. This issue is particularly appar¬ 
ent when the signal-to-noise ratio is high. The next section addresses this issue and 
describes a more robust beamformer based on the eigenstructure approach inherent in 
the MUSIC DF algorithm. 
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11.4.2 Beamformer Implementation Issues 

In most practical array-processing situations, there are several sources of error 
that compromise perfection in the prestored steering vector table. These include 
1) manufacturing differences in the array elements; 2) imprecise placement of antenna 
elements; 3) inaccurate array cable lengths; 4) mutual coupling between array 
elements; and 5) deviations of the signal wave front from planarity, especially likely 
at HR The net result is that an arriving plane wave direction vector v deviates 
from the prestored steering vector c corresponding to a particular line of bearing 

(LOB) [3]. 

Assume that the goal is to form a beam pointing in a direction that produces 
the highest signal-to-noise-plus-interference ratio in an environment modeled by the 
covariance matrix 

R*x - /W 7 + R nn , (11.23) 

where R nn represents the interference plus noise covariance matrix, and P specifies the 
signal power. In such an environment, the optimum beamformer is specified by 




NR-> 

v"R-V 


(11.24) 


But the true direction vector is unknown. If it is modeled by the steering vector 
associated with the MUSIC-derived LOB c, this becomes 


VS * ntit — 


NR-’c 

c H RZ.lc 


(11.25) 


If this is implemented with the measured covariance matrix, the implementation 
utilizes 

R*/ = R,J 1 - + Pv //rw v ) ’ (1L26) 

and the output power due to the source is 


P = P|wJv| 2 


/W 2 |p| 2 

[l + y(i — IpI 2 )] 2 ' 


(11.27) 


276 Chapter 11 


where p = v^R“Jc is the inner product of the assumed steering vector c for the LOB 
determined from the MUSIC DF process and the actual array response vector v (in 
the metric associated with the interference-plus-noise covariance), and y = Pv^R~Jv 
is the array signal-to-interference-plus-noise ratio. 1 

Equation (11.27) is helpful in understanding the issues associated with inaccurate 
steering vectors. When there is even a small discrepancy between the assumed steering 
vector for a given direction and the actual array response vector produced by the SOI, 
the inner product described by the parameter p will deviate from unity. This factor 
will affect both numerator and denominator in this expression. When the signal-to- 
interference-plus-noise, quantified by the parameter y , is small, the numerator will 
directly reduce the desired output power. But when y is large, the ensuing large 
denominator will drive the desired output power even lower. Thus, it is appropriate to 
develop alternative processing methods to mitigate such effects. 

This phenomenon motivates substantial interest in robust beamformers to avoid 
signal cancellation issues. Li and Stoica have compiled a collection of papers on this 
topic [4]. The interested reader is encouraged to consult this volume. 

11.4.3 Signal-Subspace Projection Beamformer 

Having shown that a discrepancy between an assumed steering vector and the true 
array response vector results in a degraded signal-to-noise ratio, it is appropriate to 
consider a mitigation approach. Section 9.4 described the MUSIC algorithm and 
its reliance on the concept of a signal subspace. Eigenanalysis of the sample covari¬ 
ance matrix allowed separation of the array data space into two orthogonal subspaces, 
the signal-plus-interference subspace and the noise subspace. Because the collected data 
provide the foundation for the signal-plus-interference subspace, one can expect the 
true array response vector to lie in this subspace. At the very least, this subspace 
is more likely to contain this vector than it will contain the predetermined steering 
vector associated with the LOB estimate produced by the MUSIC DF process. Such 
reasoning leads to the projection method for beamforming [5]. 

In this approach, the eigenvectors associated with the largest eigenvalues of the sam¬ 
ple covariance matrix are used as basis vectors for the signal-plus-interference subspace. 
If these eigenvectors are used as the columns of a matrix E ss , the steering vector associ¬ 
ated with the MUSIC-derived LOB can be projected into the signal subspace. Then, 


1 This expression assumes that the steering vectors have been normalized using c H R nr \ c = N. 
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the resulting vector y s = V ss c can be used to obtain the beamformer weight vector in 
equation (11.24). The resulting beamformer is thus 


w 


R -i P c 


mu 


(P 


ss 


c) h R-!(P ss c) 


(11.28) 


where P B = E„(EfE B )- 1 E" is a projection matrix onto the column space defined by 
the columns of E ss . Using these weights, one can obtain a MUSIC-based estimate for 
the SOI from 

s = wlZ. (11.29) 

This approach, projecting the MUSIC-based steering vector onto the signal subspace, 
followed by whitening, has been found to provide much more robust performance 
than the conceptual MUSIC-based copy described in section 11.4.1. 


11.5 Polarization-Based Methods 


Section 9.5 described a direction-finding technique that is useful when an antenna 
array utilizes diversely polarized elements. Such elements complicate the DF process 
because the array response becomes sensitive to the polarization state of the arriving 
signal. Such sensitivity must be quantified using a pair of array manifolds corre¬ 
sponding to two orthogonal polarization states for the arriving signal. The techniques 
described in chapter 9 provide a solution for the SOI arrival angle and polarization and 
as a result provide an appropriate estimated steering vector for the SOI. This steering 
vector can be used to provide an estimate of the signal arriving from this direction with 
the specified polarization state. 

11.5.1 Signal Copy Exploiting Polarization Differences 

Section 9.5 contains a detailed model for arrays of polarization-sensitive elements. 
Here, the responses of such an array are exploited to provide a polarization-diverse 
beamformer that can separate signals with dissimilar polarization characteristics even 
though these signals arrive from very similar directions. A polarization-based steering 
vector, estimated in the direction-finding process as detailed in chapter 9, provides the 
basis for this processing. 
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11.5.2 Polarization-Based Copy Example 

In this example, a simple two-element dual-polarized array is defined as a four-element 
array distributed along the x-axis: Elements 1 and 2 are collocated, with Element 1 
polarized in the x-direction and Element 2 polarized in the y-direction. A similar sit¬ 
uation applies to Elements 3 and 4. Thus, the odd elements are x-polarized, and the 
even elements are y-polarized. The first two-element pair is located at the origin, while 
the other pair is located 15 m in the positive x-direction. 

Two signals impinge on this array. “Signal A” is a 12 MHz CW signal arriving from 
an azimuth of 20°, while “Signal B” is a BPSK signal arriving from an azimuth of 30° 
at the same carrier frequency. Both signals are sampled at 80 Msps, and the data carried 
by Signal B use a baud rate of 0.8 MHz. 2 The baud rate is not a critical factor in imple¬ 
mentation of the polarization-based copy-processing technique. At the 12 MHz carrier 
frequency, the signal wavelength is 25 m. Thus, the element spacing is 15/25 — 0.6A, 
corresponding to a beamwidth of 1.67 rad, or 95°. Clearly, both signals arrive at the 
array well within a beamwidth of each other. However, the two signals have different 
polarization characteristics: Signal A is assumed to have the polarization characteristic 
of the even elements, while Signal B has the polarization characteristic of the odd ele¬ 
ments. As a result, Elements 1 and 3 do not respond to Signal A, and Elements 2 and 
4 do not respond to Signal B. 

To illustrate the benefits of using polarization, figure 11.1 shows the total signal for 
a nonpolarized antenna sited at the location of Elements 1 and 2 but equally sensitive 
to both polarizations. Receiver noise at a level that is 20 dB below each signals level is 
included in this illustration. It is clear that the two signals interfere with each other, 
producing a resultant that is not a constant envelope signal, as expected for either of 
the two incoming signals. 

Next, the array-processing approach described in section 11.3 is applied to the arriv¬ 
ing vector signal. It is assumed that the geometric steering vectors are perturbed from 
the ideal by random errors of about 20 percent. Because Elements 2 and 4 are the 
only ones seeing Signal A, the polarization-sensitive steering vector has zeros for these 
elements when copying this signal; a similar argument applies for the appropriate 
elements of Signal B’s polarization-sensitive steering vector. 

When these geometric steering vectors are modified by the polarization character¬ 
istics and the resulting vectors are used appropriately for the 4x1 matrix B and the 


2 These rates are used in this example to permit representation of both the array size in wavelengths and the data 
sample transition without adding the complications of a baseband, downsampling receiver to the simulation. 
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Sample number 

Figure 11.1 Composite example signal for a single nonpolarized antenna element 

4-vector v in the copy processor described in section 11.3, the results of copying Signal 
A and Signal B are shown in figure 11.2. It is clear from this figure that the copy result 
for Signal A has no evidence of any data, while the copy result for Signal B shows a 
clear data transition at the appropriate time. 

However, the effects of the steering vector errors are apparent in that the ampli¬ 
tude estimates of the two signals deviate significantly from their unit magnitudes, and 
there are other distortions in the detailed waveforms. As described in section 11.4.3, 
the estimated steering vectors associated with the two signals can be projected onto the 
signal subspace as estimated from the two largest eigenvectors of the sample covariance 
matrix. When this projection approach is used to obtain the final estimates of the two 
signal waveforms, the results are shown in figure 11.3. These estimates of the wave¬ 
forms manifest significantly less amplitude error and distortion than their equivalents 
in figure 11.2. 

11.6 Copy Weight Tracking 

The previous sections, and all of chapter 12, describe how a set of copy weights can 
be chosen to coherently combine signals from the various antenna array channels 
such that the resulting beam captures a selected signal while minimizing undesired 
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Figure 11.3 

approach 


Estimated Signal A: 100 samples at first bit transition 



Estimated Signal B: 100 samples at first bit transition 



Figure 11 .2 Estimated Signals A and B at a bit transition time 
Estimated Signal A at first bit transition, projection method 



Estimated Signal B at first bit transition, projection method 



Estimated Signals A and B at a bit transition time, using subspace projection 
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interference from other signals inhabiting the same band segment. As has been noted, 
different copy weight computation techniques provide the user with alternatives, 
each using specific criteria for determining its channel weighting. These weights are 
determined first and then applied to a block of data collected from each channel. 

However, the environment is not static, and the SOI will typically persist for a 
much longer interval than one block length. This section addresses the computation 
of a series of weight vectors, each applied in the same way as the first but computed 
in a way that considers the weight vector used for the previous data block(s). After the 
first weight vector, subsequent vectors should not only suppress interference well; they 
should also do so in a way that does not introduce temporal discontinuities into the 
resulting copy stream. It may become advantageous to base the initial weight vector 
on one (longer) block length and to iteratively modify this initial vector in response to 
a shorter data block. Techniques for such weight adaptation have not been extensively 
researched. 

A spatial covariance matrix has been seen as the central component in all copy 
weight computation approaches. Such a matrix is based on a set of successive received 
signal snapshots taken by the array over an appropriate time interval. A covariance 
matrix is computed, and a copy weight vector is derived from it. Because the environ¬ 
ment can be expected to change, another set of snapshots is taken after a “coasting” 
interval, and from it a new covariance matrix and a corresponding new copy weight 
vector are computed. This approach introduces a risk: the two weight vectors might 
be sufficiently different that a discontinuity appears in the copied signal. Such a 
discontinuity can be the source of data errors for the end user. 

One approach to avoiding such discontinuities is to utilize a set of interpolated copy 
weight vectors for those copied signal samples between successive covariance matrix 
computations. Consider the timing diagram shown in figure 11.4. 

In this diagram, the initial covariance matrix is computed from snapshots taken over 
a time T c and the next covariance matrix from snapshots taken after a delayed interval 
7Two copy weight vectors are computed from the two covariance matrices, but 
these are not used directly to provide the desired copy output from the array. Instead, 
an interpolation process is employed to generate a series of copy weight vectors, one 
for eadxcopy output sample. 

To be specific, assume that the array snapshot interval and the desired copy sample 
interval are identical, equal to S T. If the covariance matrices are computed to cover a 
greater bandwidth than that of the specific signal to be copied, this may not be true, 
but the concept to be illustrated here is still valid. In figure 11.4, the signal sampling 
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Figure 11.4 Copy weight tracking timing diagram 


times are shown as “tick” marks on the time axis at the bottom. There are a total of 
Nj = Tg[/8T samples between the starts of the two covariance computations. 

Designating the first and second of these two copy weight vectors by w j and 
w 2 , respectively one can use linear interpolation to compute individual copy weight 
vectors for each output snapshot in accordance with 


w {n) — wj 



( 11 . 30 ) 


for n — 0 : (TV* — 1). 

Such a first-order interpolation scheme will have discontinuities in slope as the 
underlying covariance matrices are switched. This may be acceptable, or not. More 
elaborate interpolation schemes, which involve additional covariance matrices but avoid 
such higher-order discontinuities, can be utilized; however, such approaches add latency 
to the copy process, and such a delay in the copy output may not be acceptable to users. 


11.7 Wideband Processing 


As has been noted many times, the bandwidth of most HF communication sig¬ 
nals is limited to a few kilohertz. This characteristic of HF communication was 
originally driven by available technology, but the difficulty of transmitting an undis¬ 
torted, wider bandwidth signal over a channel that changed rapidly, both temporally 
and spectrally, discouraged the growth of HF usage for such high-data-rate appli¬ 
cations. Further, the motivation for wider signaling bandwidths was largely absent 
because alternative methods, such as worldwide wire and space-based communication 
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networks, developed. These could be used economically and reliably for wider 
bandwidth information. 

Recently, [8] [9] there has been interest in wideband HF communications, primar¬ 
ily driven by low-probability-of-intercept (LPI) communication applications. Such 
signaling has been demonstrated with signals having about 1MHz bandwidth. This 
topic will not be considered further in this work. 

Of immediate interest, however, is the development of wideband surveillance 
receivers that capture a substantial segment of the HF band continuously and uti¬ 
lize advanced signal-processing technology to dissect the captured environment. SOIs 
are sorted from the interferers, and those signals that are worthy of attention are desig¬ 
nated for prosecution. Direction-finding and signal estimation processes are invoked 
and the products delivered to potential users. 

A system designed to operate in this way faces several questions that will be discussed 
here: 

• How much bandwidth should be covered at once? 

• How should this total instantaneous bandwidth be segmented for interference 
suppression? 

, • If this segmentation is chosen to be less than that of a single HF signal, how 
should the multiple signal segments covered by that signal be combined? 

• How does this segmentation affect the degrees of freedom available for spatial 
interference suppression? 

11.7.1 Total Coverage Bandwidth 

As discussed earlier, although the sampling rate of analog-to-digital converters (ADCs) 
was once the critical element in deciding how much of the HF band should be covered 
by a single sensor, the recent availability of 16-bit converters that sample at more than 
80 Msps has changed the story. Such an ADC is able to sample the entire HF band, and 
feed those samples to high-speed processors able to carry out fast Fourier transforms 
(FFTs) with this data in real time. As a result, the limiting factor in determining the 
total sensor bandwidth is the dynamic range of the amplifiers preceding the ADC. 
The desire for at least 90 dB dynamic range in the system is still an effort for such 
preconditioning amplifiers. 

Second harmonic distortion is the primary limitation. The potential for such dis¬ 
tortion suggests that any band segment should not cover more than about 1.9 times the 
lowest frequency; if this limit is obeyed, the second harmonic of the lowest frequency 
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falls outside the sensor coverage and can be filtered out before the wideband signal is 
presented to the ADC. 

A strategy for covering the entire HF band is derived from such considerations. 
Following the 1.9 rule suggests that the topmost band start at about 16 MHz. The 
next lower band should be 9-16 MHz; then 5-9 MHz; 2.8-5 MHz; and finally, 
2-2.8 MHz. 

The difficulty with this approach is that the data capture rate varies substantially 
from subband to subband. If the data-processing and storage subsystems are designed 
for the uppermost band segment, they are overdesigned for the lower segments. 
Such considerations lead to compromises such as: 2-6 MHz, 6-10 MHz, 10-14 MHz, 
14-18 MHz, 18-24 MHz, and 24-30 MHz. Such a design risks second harmonic 
distortion in the lowest band segment, but this subband is relatively unimportant 
operationally. The risk might be worthwhile considering that the data storage rate req¬ 
uirement is reduced from that associated with 14 MHz by a factor 14/6, even though 
the first option requires only five subbands rather than six. As data storage technology 
evolves, the tradeoff might swing toward the five subband option. 

11.7.2 Bandwidth Segmentation 

Once the ADC samples are available, they must be buffered and prepared for band 
segmentation. This process starts with the full bandwidth of the subband in ques¬ 
tion and divides it into very narrowband segments using FFT processing. In todays 
computing environment, it is appropriate to combine this process of creating very 
narrowband cells with digital down-conversion so that the resulting process leads to a 
group of contiguous baseband cells covering a portion of the sensor bandwidth. There 
are many alternatives for carrying out this process, and it is not the goal of this section 
to discuss them in detail; the choice among them should be based on digital data 
networking factors, which are beyond the scope of this work. 

Instead, the focus here will be on the bandwidth of the individual band segments. 
This design parameter should be based on the interference environment and on the 
SOI bandwidths. Because most SOIs have audio 2.4 kHz bandwidths as dictated by 
early HF receiver designs, this bandwidth is as wide as one is likely to consider. Most 
SOIs have a bandwidth such as this. Interference, however, can come from such a 
variety of signals that a narrower band segment is generally used. This plan is also 
supported by the realization that HF communications are not disciplined and that an 
SOI might be overlapped with multiple interferers, each with a different bandwidth, 
some wider and some narrower than the SOI. An SOI bandwidth of 2.4 kHz might 
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therefore be subdivided by a factor of 4, 8, or even 16, leading to segment bandwidths 
as small as 150 Hz. 

11.7.3 Segment Combining 

Assume for this discussion that the segment bandwidth has been set at 600 Hz, or 
one quarter of an SOI bandwidth. These segments will be created contiguously across 
the sensor subband bandwidth so that a typical SOI will not necessarily fall neatly 
into four of these segments. More likely, it will span five segments. The sensor has 
two approaches for processing this SOI once it has been detected, and its operat¬ 
ing parameters, center frequency and bandwidth, have been estimated. The processor 
could ignore the segments, which, as was discussed in chapter 7, were used for detec¬ 
tion processing and use a narrowband receiver with an appropriate center frequency 
and bandwidth for each of the N sensor antenna channels. Then, the adaptive array 
signal estimation (copy) algorithm could be used with these length N signal vectors 
to produce the desired copy channel at the detected frequency and bandwidth. 

Figure 11.5 shows schematically how this signal-specific, receiver-based approach to 
interference suppression might be carried out. 

Alternatively, a more aggressive approach would use the narrowband segment data 
directly to form separate antenna channel signals in each segment, to form several 
segment-based beams, and to combine their outputs to obtain a signal that spans these 
band segments. Such a processing architecture is known as a transmultiplexor approach. 



beams 


Figure 11.5 Signal-specific, receiver-based approach to copy processing, K signals 
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Figure 11.6 Transmultiplexor approach to copy processing, K signals 

Figure 11.6 shows schematically how this transmultiplexor approach to signal copy 
might be carried out. 

The advantage of this approach is that the full set of array degrees of freedom are 
available for the interference in each segment, and the total number of interferers 
that could be addressed increases accordingly. If the interferers are very narrowband 
and only span one segment, the total number of interferers would be increased by 
a factor equal to the number of band segments spanned by the SOL For example, 
if the sensor includes eight antennas, the segments have a 600 Hz bandwidth, and 
the 2.4 kHz bandwidth SOI spans five of these, there will be 40 degrees of free¬ 
dom available for interference cancellation. This is in contrast to the 8 degrees of 
freedom available with the first approach. If some of the interferers span multiple 
segments, the number of available degrees of freedom would fall between these two 
extremes. 

Despite its advantage, the second technique is still a research topic. The key chal¬ 
lenge arises from the reconstruction processing needed to reassemble the signal from its 
segment components after interference suppression is complete. Segment-to-segment 
phasing and gain control are critical if the signal is to emerge undistorted. This 
advanced approach has potential for countering extremely heavy interference, but the 
reconstruction process could “throw out the baby with the bath water 5 and leave the 
resulting signal so distorted that it cannot be demodulated effectively. 
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The previous chapter introduced the signal copy topic and presented specific tech¬ 
niques that are applicable to a wide variety of signal types. This chapter continues to 
explore the signal copy area but recognizes that narrowing the range of applicabil¬ 
ity to a specific class of signals allows a copy algorithm to exploit its specific features 
and as a result, to achieve better performance, especially in environments with heavy 
interference. Several algorithms in this class will be described next. 

12.1 Overview 

Many communication signals have well-defined features associated with their specific 
format. Such a feature can be exploited in an extraction process that eliminates any 
other signal in the environment that compromises this feature in the output. Such 
processes are categorized as signal restoral techniques. In the context of adaptive array 
processing, such techniques take the form of a weight vector that is applied to the 
received signal vector, the result being a single data stream with minimum contri¬ 
butions from any signal other than the one with the specified feature. In effect, the 
weight vector implements a beam to provide gain in the direction of any energy that 
exhibits a specified feature and reduces gain in any directions associated with other 
energy sources. Such signal restoral techniques contrast with the spatial-coherence app¬ 
roaches, which rely entirely on the plane wave characteristics of the signal arriving from 
a given direction to cohere the signal of interest (SOI). Such spatial techniques were 
detailed in the previous chapter; they rely on array calibration information, associat¬ 
ing complex gain vectors (relative amplitude and phase for each channel) with each 
possible direction. The set of such vectors, together with the associated direction of 
arrival, is the so-called array manifold. 
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This chapter addresses such signal restoral approaches, with an introductory section 
covering those signals manifesting a well-defined carrier component. This class of 
signals includes the commonly encountered amplitude modulation, and it provides 
an initial encounter with a broader class of signals known as cyclostationary signals, 
which are covered as a signal class in the next section. This class is important because 
it includes many digital signals that exhibit rate lines in their spectra. 

The next section addresses spectral difference processing, a technique based on the 
idea that all spectral subbands associated with a (narrowband) signal should be char¬ 
acterized by the same steering vector. Any deviation suggests that there is interference 
arriving from a different direction that is not a component of the SOI, and it should 
be nulled. 

Following this, attention turns to signals with particular amplitude properties. 
On-Off keyed signals will be covered first, followed by a discussion of constant 
modulus signals. 

Another important topic concerns voice-modulated signals, including amplitude 
modulation, which was introduced earlier when discussed as a carrier-based signal. 
However, at HF single-sideband and double-sideband suppressed carrier formats are 
also frequently used. Frequency modulation is also covered. 

As an HF-specific topic, the type of signal associated with the automatic link estab¬ 
lishment (ALE) protocol will be described, and recommendations for appropriate 
signal-extraction algorithms useful when dealing with an ALE signal will be discussed. 

The final signal type to be discussed concerns over-the-horizon radar (OTHR) 
signals. Adaptive array processing for such radar signals is a relatively new topic with 
much ground to be explored in the future [1]. However, the use of the techniques asso¬ 
ciated with communication signals is applicable, and those array-processing options 
useful for radar signals will be covered. 

12.2 Carrier-Based Nulling 
12.2.1 Carrier Signal Formats 

Consider a real RF carrier signal co sco 0 t modulated by the real, low-pass time 
function a{t ), 

s(t) = a(t) cos co 0 t . (12.1) 

The spectrum of the carrier alone, being a real function, consists of impulses at 
-E(0 0 ; the spectrum of the modulation, also specified as real, has a low-pass modulation 
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Figure 12.1 Spectrum of a double-sideband modulated waveform 

spectrum A(co) that exhibits conjugate symmetry about co = 0. The modulation 
process, mathematically equal to the product of these two waveforms, produces a spec¬ 
trum at its output equal to the convolution of their two spectra. Thus, the modulated 
waveform has a spectrum that consists of two shifted replicas of the modulating spectra, 
centered about the frequencies ia> 0 . The replica at +co 0 is illustrated in figure 12.1. 

This approach to generating a modulated RF signal is known as double-sideband 
modulation . It is typically generated with a device known as a balanced modulator , 
which if carefully constructed will not produce a carrier component at its output. As 
a result, the double-sideband modulated signal will not include a carrier component. 

A more common form of broadcast signal has a format in which the envelope of the 
transmitted waveform is modulated with the low-pass signal to be transmitted. This 
transmitted signal, known simply as conventional AM modulation, is represented by 

s{t) — [1 + m a a{t)\ cos oo 0 t . (12.2) 

Here, the constant factor m a is known as the modulation index; it is chosen such that 
the envelope, indicated in brackets in equation (12.2), is less than the amplitude of 
the carrier, in this example equal to unity. Should the envelope exceed this value, the 
result would be overmodulation, resulting in distortion at the receiver. 
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Note that the transmitted signal is now the sum of a double-sideband modulated 
signal and a simple carrier. Thus, its spectrum includes both the two sidebands seen 
with double sideband modulation and a carrier tone. Such signals are useful because 
a suitable receiver is simpler; it does not have to regenerate a carrier locally: the carrier 
is included with the transmission. 

12.2.2 Algorithm Description 

An adaptive array-based HF receiver can exploit this carrier to develop an antenna 
channel weight vector to use as a beamformer. As will be detailed, the approach is 
based on a very narrowband filter that spans the frequency associated with the carrier 
of interest, assumed to be known at the receiver. In practical terms, there will be limits 
to the accuracy of this knowledge, especially at HF, where the ionospheric channel 
introduces frequency shifts in a transmitted signal. Furthermore, the SOI may only be 
transmitted for a short time, limiting the available frequency resolution of a processor 
that is seeking to exclude interference and thus not wanting to include inputs before or 
after this signals temporal extent. Consequently, a lower limit to the filter bandwidth 
for an HF signal might be on the order of 10 to 100 Hz. Since a filtering processor 
designed to achieve a bandwidth of B Hz must coherently process its input for \/B 
seconds, the corresponding filter must process inputs for as long as 100 ms (10 Hz 
bandwidth). For a brief signal, such a processor might only produce a few indepen¬ 
dent output samples, yielding little information about the carriers modulating data. 
Nevertheless, if the only goal of this processor is to estimate the carrier s steering vector, 
such information is not necessary. 

Now to detail the array-processing approach to capturing this signal. Assume for 
the moment that there is only one signal in the environment manifesting a particular 
carrier. As in so many of the processors covered in this work, this signal is modeled as a 
steering vector multiplied by a scalar time series corresponding to the carrier associated 
with the SOI. If z f(t) is defined as a vector of channel signals at the output of the 
narrowband filter described earlier, 

if(t) = vco s(co 0 t + (j> 0 ) +i(t). (12.3) 

The modeled signal Xf(t) is not the set of raw baseband antenna outputs, z(t), but 
is the output of the narrowband filter included in each antenna channel. This filter is 
designed to be very narrow and may not pass any signal components other than the 
carrier associated with the SOI. If there are any such components, either as part of the 
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SOI or possibly some other interfering signal, their effects on x(t) are to be included 
in the interference component i (t). 

As discussed previously, the time series zy(V) has a bandwidth equal to the filter 
bandwidth B so that independent samples are available every 1/5 seconds. The pro¬ 
cessor can capture a sequence of these samples from each antenna channel for a time 
no greater than the SOI s expected duration and place these samples in the columns 
of a matrix Zy, with separate rows corresponding to the different antenna channels. 
The processor should then produce an appropriate linear combination of the channel 
signals to represent the expected carrier. The coefficients in this linear combination can 
be arrayed in a weight vector wjvbf that is applied to the matrix Zy. 

Assume that there are L samples from each of the N antenna channels. Then, the 
matrix Zy has dimensions N x L, and the vector w^y bf has dimensions N x\. The 
linear combination suggested previously is then specified by 


s — ' w NBF^ J f‘ (12.4) 

Obviously, the dimensions of the vector s are 1 x L. 

Next, the computation of the weight vector w nbf that produces the most accurate 
representation of the carrier is addressed. 

Once the weights have been chosen, the deviation of the resulting signal from the 
expected carrier signal is given by 

S — S = S — Wyyg ^Zy, (12.5) 

where e is an error row vector with dimensions 1 x L. To find the weight vector that 
minimizes the mean square error, this expression is squared and averaged. 


e\ 2 = \s\ 2 - 2Re[vft BF Z f s H ) + v% BP Z f Zf wy vbF 


( 12 . 6 ) 


so that 

E\\€\ 2 ] = S — 2Re[w^^r.y] + W/yg f W NBF , (12.7) 

where S = jEfss^] is the energy in the signal s, r,y is a column vector of correlations 
between the known signal carrier (reference) and the received channel signals, and 
% = Z fZj is the channel-to-channel covariance matrix computed from the filtered 

data. Here, , 

L 


* z f s ~ T ^/ S 


H 


( 12 . 8 ) 
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Taking the gradient, 

V w (A[|e| 2 ]) = —2r ZfS + 2Rjw mbf (12.9) 

so that setting the result equal to zero yields 

W NBF — (12.10) 

In the present scenario, the specified carrier is assumed to be the only impinging sig¬ 
nal in the environment that correlates with the channel outputs. As a result, since 
s = s(t)y c , 

= Sv c , ( 12 . 11 ) 

where \ c is the steering vector associated with the signal carrier. Thus, r ^ becomes 
an estimator for v 0 the steering vector of the signal with the embedded carrier. 
That is, 

\ c oc r x fi. (12.12) 

But the complete SOI is assumed to be narrowband; consequently, the same steer¬ 
ing vector applies to all its components, not just the carrier. Hence, a weight vector 
that is useful for the entire signal can be determined using this steering vector but 
instead of characterizing the interference by means of a covariance matrix computed 
from the filtered data, using the channel signals occupying the bandwidth associ¬ 
ated with the entire SOI to compute the covariance matrix and using it in place 
ofRff. 

Thus, the weight vector becomes 

W MBF = (12.13) 

Consequently, the processor employs two sets of N filters, one very narrowband set 
to isolate the carrier and a second set to obtain data to characterize the interference 
over the SOI bandwidth. The outputs of the second set are used as the N rows of a 
matrix Z, which is then used to compute R^ for use in 12.13. 

R zz -ZZ h . (12.14) 

Finally, the SOI is estimated from 

s = Wjy BF Z. 

Figure 12.2 summarizes the processor configuration in block diagram form. 


(12.15) 
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Figure 12.2 Block diagram of carrier-based nulling processor 

12.2.3 Example 

As a simple example to illustrate carrier-based nulling, consider an environment with 
a conventional amplitude-modulated signal carrier at 100 kHz, manifesting two side¬ 
bands as well as a carrier; the modulating bandwidth is about 2 kHz, and the signal 
is arriving from an angle that is 20° off the array broadside. In addition to this sig¬ 
nal, the environment includes a tone centered in the upper sideband, arriving from 
an angle 50° from broadside, and of course, receiver noise. The interference tone dis¬ 
torts the upper sideband of the SOI, likely disrupting its demodulation. Any spectral 
filtering approach to interference suppression would have substantial effects on the 
demodulation process. 

An eight-element linear array with element spacing of 0.45 A receives this environ¬ 
ment, and the goal is to process the multichannel received signal to recover the SOI. 
Figure 12.3 shows the composite spectrum of the SOI and interference at one of the 
antenna elements. 

The carrier-based nulling technique is applied to the received signal, developing a 
set of beamformer weights as described at the beginning of this section. The result 
produces the array pattern shown in figure 12.4. It is clear that the resulting pattern 
protects the SOI direction, with much reduced gain in the direction of the interfering 
tone. Figure 12.5 shows a segment of the signal at the beamformer output, as well 
as the same segment of the transmitted SOI, illustrating the interference elimination 
power of this beamformer. 
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12.3 Carrier-based nulling example: antenna element signal 



Figure 12.4 Carrier-based nulling example: beamformer array pattern 
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Figure 12.5 Carrier-based nulling example: input and CBN copy output segments 

12.3 Rate Line-Based Approaches 

The previous section explained how an adaptive array processor can exploit a carrier 
when a narrowband signal manifests such a feature. This technique can be extended 
for a wide range of signals that include other so-called cyclostationary features [5]. The 
resulting methods are covered in this section, beginning with a generalized description 
of cyclostationary signals and an explanation of their relationship to those signals that 
include a carrier. With this foundation, adaptive array-processing techniques for cyclo¬ 
stationary signals will be described in detail. It will become clear that the carrier-based 
nulling of the previous section is a special case of these methods. 

12.3.1 Cyclostationary Signal Description 

Many communication signals that do not include a specific carrier frequency in 
their spectral characteristic have a hidden periodicity that can be exploited for sig¬ 
nal estimation. Such hidden features can be unveiled by passing the signal through 
a nonlinear transformation, typically with a quadratic characteristic. Such a signal 
can be modeled as having statistical parameters that vary in time in a periodic man¬ 
ner and are thus identified as cyclostationary signals. This attribute is specifically seen 
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in the signal moments, which display second-order periodicities. As a consequence, 
the spectrum manifests correlations between spectral components of the seemingly 
random fluctuations often associated with signal modulation. 

Although the focus herein is on exploiting these cyclostationary features for signal 
estimation purposes, they can also be exploited for detection, direction of arrival 
estimation, time difference of arrival estimation, or signal identification. 

Cyclostationary signals are characterized by the cyclic autocorrelation function, which 
is a generalization of the conventional autocorrelation function. Their formal def¬ 
inition states that a signal exhibits cyclostationarity if and only if it is correlated 
with certain frequency-shifted versions of itself. Once the cyclic autocorrelation func¬ 
tion has been defined, it leads to its Fourier transform, the spectral-correlation density 
function. It will be seen that these two cyclic characteristics bear the same relationship, 
as does the conventional autocorrelation function and its Fourier transform, the power 
spectral density, as stated in the Wiener relation [15]. 

It is well known that a signal x(t) contains a spectral component with frequency a 


if the Fourier coefficient 


Mf — {x(t) exp(—i2nat)) 

(12.16) 

is not zero, 1 in which case, if 


s(t) — acos(2irat + rj) 

(12.17) 

with a 0, 


„ 1 

M s — —a^xp{ir]). 

2 

(12.18) 


As a result, because s(t) is real, its PSD includes a spectral line at frequency^ = a and 
at its image frequency/’ = —a. Thus, the PSD can be expressed mathematically to 
include 

\M“\ 2 [S(f-a)+8(f + a)]. (12.19) 

Such a PSD indicates that the signal exhibits & first-order periodicity. 

As indicated, signals that exhibit a second-order periodicity are of primary interest 
here. Such signals do not manifest spectral lines in the PSD, but they can be processed 
to display a first-order periodicity by a quadratic transformation; as a result, the PSD 
of the transformed signal does display spectral lines. 


1 The notation ( a(t )) designates time average of the function a(t). 
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It is easily demonstrated that the amplitude-modulated signal s(t) = a{t) cos 
{Infot+rj) has no spectral lines in its PSD, but if this signal is transformed by a 
squarer, y(t) =r 2 (^), the resulting PSD does have lines at 0, +f 0 and —f 0 . 

Other first-order signals, such as a binary pulse-amplitude-modulated signal with 
full duty-cycle pulses, once squared, will have a spectral line at f = 0 in their PSDs as 
a result of the squared waveform having a nonzero average power. But there will be 
no spectral line in the PSD at the pulse rate. A quadratic transformation involving a 
delay can produce such a spectral line. In particular, if the transformation 

y(t) — s(t)s(t — r) (12.20) 

is used, there will be lines in the PSD a.tf = l/T 0 , where T 0 is the width of the 
modulated pulses. 

This observation leads to this definition for the second-order periodicity property 
of a signal s(t ): the signal s{t) contains second-order periodicity if and only if the PSD 
of the delay product signal in equation (12.20), for some delay r, contains spectral 
lines at some nonzero frequencies a f 0. It will prove to be more convenient to work 
with the more symmetric delay product 

y x (f) =s(t + r/2)s*(t — t/2). (12.21) 

The complex-conjugation of the second multiplicand is included to allow application 
to complex-valued signals. From this definition, one can obtain the Fourier coefficient 
of the sine wave component with frequency a contained in the delay-product signal 
of equation (12.21). 

i^(r) = (s(t + r/2 )s*(t — t/ 2) exp (—Hit at)). (12.22) 

In the same way that equation (12.16) relates to equation (12.17), this is clearly the 
Fourier coefficient M® x of the sine wave component with frequency a in the delay- 
product signal given in equation (12.21). 

Note that this expression is simply a generalization of the conventional autocorre¬ 
lation function in which a cyclic weighting factor exp(—i2nat) is included prior to 
time averaging. This function is known as the cyclic autocorrelation function. It is also 
related to the delay-Doppler ambiguity function [10] and the Wigner distribution 
function [11]. 

It is noteworthy that can be viewed in three ways: 

• It is the generalization of the conventional autocorrelation function to include 
the cyclic weighting factor exp — Hitat as just described. 


300 Chapter 12 


• It is the Fourier coefficient from the PSD of the delay-product signal y(t) defined 
in equation (12.20). 

• By splitting the exponential into two factors, one can see that it is simply the 
cross-correlation function of the signal y(t) shifted in frequency by +a/2 with 
the same signal shifted in frequency by —a/2. 

This last view arises from the observation that multiplying a signal y{t) by exp {mat) 
to obtain u(t) corresponds to a shift in its spectral content by a/2, leading to a Fourier 
transform given by U(f) = Y(f + a/ 2) and a PSD equal to S u (f) =S y (f + a/ 2). 
Similarly, multiplying y(t) by exp(— mat) to obtain v(t) corresponds to a shift in its 
spectral content by —a/2, leading to a Fourier transform given by V if) = Y(f — a/2) 
and a PSD equal to S u (f) = S y (f — a / 2). Thus, s{t) exhibits second-order periodicity 
when its frequency-shifted versions are correlated with each other. 

It should be clear that if the mean values of u{t) and v{t ), the frequency-shifted 
versions of y(t), are zero, y(t) does not contain sine wave components at frequencies 
dzOf/2. Thus, S y {f) has no spectral line at these frequencies. In this case, the temporal 
cross-covariance K uv ( r) is equal to the cross-correlation R uv {x) — R^{x): 

K uv { r) = ([«(* + r/2) - + r/2))] • [v(t - r/2) - (v(t - r/2))]*) 

= {u(t + r/2)v*(t - r/2)) (12.23) 

This suggests that the temporal cross-covariance K uv { r) might be normalized by the 
geometric mean of the two corresponding covariances evaluated at r = 0, 

iC(0) = R U (0) = (\u(t)\ 2 ) =^(0), (12.24) 

AT„(0) =/?„(0) = (|K?) | 2 ) =^(0), (12.25) 

leading to a definition for the temporal correlation coefficient for frequency translates. 


K'Uvi.'t') 

[/4(0)/4(0)]V2 



( 12 . 26 ) 


The latter half of this equation suggests an appropriate normalization factor for the 
cyclic autocorrelation function, BY, is 1/R y (0). 

Recall the definition of a wide-sense stationary signal. Such signals possess an auto¬ 
correlation R y ( r) that is not identically zero. Following this idea for cyclic signals, 
one says that a stationary signal for which R^ = 0 for all a ^ 0 is purely stationary , 
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and signals with R® 7 ^ 0 for some a /= 0 are cyclostationary. The set of frequencies for 
which o' 7 ^ 0 is called the cycle spectrumr 

These notions of cyclostationary signals just described through their temporal corre¬ 
lation properties can be carried over to the frequency domain. With any signal s(t ), one 
can consider passing it through a narrowband bandpass filter at a frequency / 1 and mea¬ 
suring the power at the filter output. If one extends this idea to a set of such filters with 
contiguous pass bands and lets the bandwidth of these filters approach zero, the corre¬ 
sponding power measurements normalized by the filter bandwidth provide the PSD. 

Similarly, one can pass the two frequency translated signals u{t) and v{t) through 
two sets of narrowband filters and measure the temporal correlation of the outputs. The 
result is the spectral-correlation density (SCD) function. This is seen to be the spectral 
density of correlation in s(t) at frequencies/^ + a /2 and f — a/2. 

Basic signal-processing theory rests on the Wiener relation: the PSD is equal to the 
Fourier transform of the autocorrelation function. In the same way, the SCD is the 
Fourier transform of the cyclic autocorrelation function, 

p-j-00 

S“ (/) = / i^(r) txp[—i2nfr]dr. (12.27) 

J’—OO 


This relationship has become known as the cyclic Wiener relation, from which the 
conventional Wiener relation follows as a special case. It also has been described as 
the cyclic spectral density function. It should be apparent that the SCD is the Fourier 
transform of the cross-correlation function R uv ( r) so that S%(f) = S uv (f). 

Again, a normalization of this cross-spectral density function will be useful, using 
the geometric mean of the corresponding PSDs: 


Suv(0 


<s;(f) 


[S u ({)S v m /2 [Sy (f + a/2)Sy(£ - a/2)] 1 / 2 ^ 


=/W>- 


(12.28) 


7 


Whenever one has \Py(f)\ = 1, one has complete spectral redundancy. 


12.3.2 Algorithmic Approach 

With this background on cyclic signals, it is appropriate to turn to exploiting such 
features for signal estimation using an antenna array. Assume that the environment 
contains a single signal s{t) that manifests the cyclic frequency a; although the 


2 Gardner in IEEE SP Mag, April 1991, provides an AM signal example. 
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environment contains interference and noise, these components are not spectrally 
self-coherent at a, in which case if z(t) — as(t) + n(t), 

i^(r) = W 2 ^(r) +i?f (r) =- \a\ 2 % s (x). (12.29) 

This finding can be interpreted to indicate that the cyclic autocorrelation of the 
received signal is unchanged by the addition of arbitrary interference as long as such 
interference and the signal are not spectrally self-coherent at a cyclic frequency a. 

The goal is to determine an appropriate weight vector w to apply to the array 
output vector z (t), thereby producing a signal y(t) — w^z(f) that consists of 
the one environmental component possessing the specified self-coherence property, 
the SOI. 

The key lies in quadratic transformations of the array element outputs. 

12.3.3 Algorithm Description 


The algorithm described here is based on work by Agee, Schell, and Gardner [2]. 

Recognizing that all array element signals include the SOI, a scalar reference signal 
is formed using a so-called control vector K to weight these array element outputs 

r(t) *(t — r) zxp(i2?cat). (12.30) 


The algorithm is based on finding a weight vector w that can be used to form a beam, 
the output of which matches this reference as closely as possible. This vector is arbi¬ 
trary; any vector that is not orthogonal to the array response vector will work, though 
performance depends on the particular choice. Techniques that optimize performance 
by careful selection of this vector have been described in the literature [12]. 

In this algorithm, a reference signal is formed as a linear combination of the 
array element signals with the coefficients specified by the “control vector” of equa¬ 
tion (12.30). Then the array element signal time series is used to compute a cyclic cross¬ 
correlation with the reference at the frequency and time delay indicated previously in 
accordance with 


zr 


/;«■*<= 


(12.31) 


Because the arriving interference components do not cyclically correlate at this point, 
while the target signal does, the result is an estimate of the array response vector in 
the direction of the SOI. This array response vector is used in conjunction with a 
covariance matrix R formed directly from the array snapshots to obtain a copy weight 
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vector, R ; y = R ~ ] r zr that protects the SOI direction while nulling signals 
arriving from other directions. 

Specifically, a metric F is chosen in accordance with 

F = {\y(t)-r(t) | 2 }, (12.32) 

where (• • •) denotes time averaging. Then, 

F — (|w^ z (^) — r(t) | 2 ). (12.33) 

Multiplying out, the result is 

(w^z(£)z^(f)w — 2Re[w^ z(^)r*(^)] + const] = R^w — 2Re[w^ r zr j + const, 

(12.34) 

where is the conventional array covariance matrix and r zr is a vector represent¬ 
ing the cross-correlation of the reference in equation (12.30) with the array element 
outputs. Taking the gradient and setting the result to zero produces 

w = R; 2 1 r 2r . (12.35) 

This weight vector can then be applied to the array output signal vector in the usual 
way to obtain the desired estimate of the SOI 

(12.36) 

12.3.4 QPSK Example 

As an interesting example, consider a QPSK signal as the SOI and a Gaussian 
amplitude-modulated signal as interferer; receiver noise is also included in the environ¬ 
ment model. The QPSK signal is composed of 2.5 yusec symbols, leading to a signal 
bandwidth of 400 kHz. A four-element linear array receives the two signals, which 
arrive from 20° and —30°, respectively. The rate line associated with the SOI is at 
±400 kHz, corresponding to the signal bandwidth, which is to be used as leverage to 
form a beam directed toward the SOI, with a null in the interferer direction. This rate 
line is observable in the cyclic autocorrelation function of the arriving signal after it 
has been shifted to baseband, as shown in figure 12.6. The minor peak in this func¬ 
tion, at a frequency equal to the inverse of the symbol duration; that is, 400 kHz, and 
a delay equal to the symbol duration; that is, 2.5 yusec, is exploited in the processing; 
the interference and receiver noise waveforms do not manifest peaks for these values 
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Figure 12.6 Cyclic autocorrelation function of arriving baseband QPSK signal 

of frequency and delay, allowing the array response vector to be extracted from the 
element signal snapshot vectors. 

Figure 12.7 shows the beam implemented with this set of weights in the example 
scenario. It is apparent that there is a null in the direction of the interferer (—30°) 
and a beam peak in the protected direction (+20°). Thus, the cyclostationary process¬ 
ing approach provides substantially improved signal-to-interference-plus-noise ratio 
(SINR) for effective data recovery from the SOI. 

Figure 12.8 shows the results of the copy process. In the upper trace, the constant 
modulus of the signal is apparent; in the middle trace, the phase of the copy output 
tracks the encoded values (0, 1, 2, or 3) representing user data, which is shown in the 
lower trace. Because the algorithm described here does not include an absolute phase 
reference determination, there is no way to compute an accounting of symbol errors. 
Such an algorithm could be included as a postprocessing addition. 

Finally, figure 12.9 shows the constellation of the QPSK signal after copy has been 
performed using this technique. It should be clear that accurate demodulation will be 
straightforward with the addition of the phase determination process. 
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Figure 12.7 Shape of the copy beam used for the example QPSK signal 
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Figure 12.8 Copy results obtained for the example QPSK signal 
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Figure 12.9 Signal constellation for the example QPSK copy process 

12.4 Constant Modulus Processing 

12.4.1 Constant Modulus Signal Formats 

Motivated by the high efficiency of power amplifiers when operated at their maximum 
power level, a wide variety of HF communication signals employ a format manifest¬ 
ing an envelope, or signal modulus, that does not change with the modulating data. 
Known widely as constant modulus (CM) signals, important examples include PSK, 
QPSK, FSK, and CDMA; these formats were described earlier in chapter 3. Because 
of their widespread use, array-processing algorithms that exploit this signal feature 
deserve detailed descriptions. This section starts with a detailed description and illus¬ 
tration of the basic algorithm and goes on to describe, in less detail, embellishments 
that enhance capabilities of the basic technique. 

12.4.2 Basic CM Beamformer Description 

The basic CM algorithm was introduced for array processing more than twenty five 
years [6, 7] ago to address two important shortcomings of the least mean square (LMS) 
algorithm: poor performance when the interference is correlated with the signal and 
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sensitivity to array geometry details that limit LMS usefulness when array element 
locations are poorly known. The “CM array” has its roots in blind equalization applied 
to data modems. 

The CM array-processing algorithm works with the complex vector of baseband 
element signals z(t), the components of which have been filtered to limit their band¬ 
width to that of the SOI. In [7] it is assumed that the data samples are synchronous 
with the data bauds so that there is one sample at the peak of each data baud. This 
assumption has been relaxed in more recent CM algorithms. 

Beamforming is carried out in the usual way, with a set of complex weights multi¬ 
plying the array element signals and the outputs summed to produce the copy stream. 
The weights steer the beam to provide gain in the direction of the SOI and deep nulls 
in the interference directions. N — 1 nulls can be generated by an adaptive controller 
which determines appropriate array element weights. The CM array employs an adap¬ 
tive algorithm based on the element signal fluctuations caused by the interference and 
other distortions; this algorithm sequentially runs through the array snapshots, adjust¬ 
ing the weights to eliminate these fluctuations, which lessen as the beamshape adapts 
to the environment in the desired manner. 

The specific approach used here utilizes a cost function equal to the mean square 
of the difference between the beamformer output envelope and the desired unit value 
(after normalization) of the CM signal, | y(t) \? — 1, where p is an integer specifying an 
appropriate norm. A steepest descent algorithm can be used to adjust the array weight 
vector w to minimize this cost function. Specifically for p — 1, this is implemented 
using the error function 

€W = |(|{yW/b'WI}|-l)l- (12.37) 

With the error function defined by equation (12.37), the update-equations are then 

y(ri) = vr H z(n); w (n + 1) —w (n) + /xz(n) c* (n) . (12.38) 

The initial weight vector w(l) can be chosen as an equal weight vector, corresponding 
to a beam pointed in a nominal direction that is in the broadside direction if the array 
is linear; fJL is a constant that controls the rate of convergence and stability of the weight 
determination process. A sequence of data snapshots can then be processed to reach a 
set of weights that produce a CM result. 

This approach can be viewed as a variant of the LMS algorithm in which the desired 
signal is now identified as y(n)/\y(n)\. Because the LMS algorithm suggests premulti¬ 
plication of the input signal vector z(t) by the inverse of the estimated input covariance 
matrix, R^\ such a procedure might be useful in CM processing. Here, such 
premultiplication is viewed as a variant and is not utilized in the example presented. 
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It is noteworthy that this CM technique does not utilize a quadratic cost function 
and thus has no analytic solution for the minimum cost weight vector. In fact, the 
shape of the cost function is complex and has many saddle points. There is some risk 
that the iterative process might converge to an undesirable beam shape. 

12.4.3 CM Algorithm Example 

As an example of the CM array-processing operation, consider a simple example. A 
BPSK signal is transmitted with a 12 MHz carrier and a 1 kHz data rate. This signal 
impinges on a four-element linear array with 12.5 m element spacing (half wavelength 
at 12 MHz) at an arrival angle of 20° from broadside. In addition to this SOI, there is an 
interferer with the same power as the SOI, but with Gaussian amplitude modulation, 
arriving from —30°. Receiver noise that is 20 dB below the SOI is also included. The 
topmost trace of figure 12.10 shows the data-modulated transmitted BPSK signal, and 
the middle trace shows the composite signal amplitude at Element 1 of the array. 

The above algorithm was simulated to generate a 100 bit SOI at RE At each ele¬ 
ment of the array, this signal was down-converted to baseband, low-pass filtered, and 
down-sampled; since the resulting signal was sampled at 10 kHz and the baud rate was 
1 kHz, the SOI was over sampled by a factor of 10. As previously noted, the center 
trace of figure 12.10 shows the resulting signal from Element 1; a vector of such signals 




Symbol number 

Figure 12.10 CM example: 1 kHz data rate SOI with Gaussian AM interferer 
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Figure 12.11 CM example: copy beam pattern 

* - - - / 

serves as input to the CM algorithm. Note that the signal at Element 1 is not CM due 
to the presence of the Gaussian modulated interferer. 

The algorithm described earlier was run with the parameter (i = 0.1. The lower plot 
in figure 12.10 shows the results of the copy output. It is apparent that the CM copy 
algorithm has converged-after about eighty symbols. Figure 12.11 shows the shape of the 
beam formed by the process. This pattern gain is low close to the direction associated 
with the interference (—30°) and the maximum possible gain from a four-element 
array (6 dB) in the direction of the SOI (+20°). For this example, the relative rejection 
of the interference is 15.9 dB. 

12.5 Spectral Difference Processing 

12.5.1 Overview 

This section describes a technique that provides estimates of multiple signal waveforms 
impinging on the HF array. Known as spectral difference processing ,, this technique is 
a so-called blind technique because it does not use any a priori estimate of the signal 
source direction in its calculations. Spectral difference processing is based on a multiple 
filter model exploiting the channelization concepts described in chapter 5. 
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When considering the narrowband signals associated with most HF signaling pro¬ 
tocols, with bandwidths usually less than 5 kHz, the array response vector will certainly 
be invariant across the signal bandwidth. Thus, if the captured data are filtered into 
much narrower frequency cells, with bandwidths that are, say, 10 percent of the typical 
HF signal bandwidth, any array response vector associated with a specific signal will 
be common in all these cells. This feature, which is buried in the collected signal data, 
provides leverage that spectral difference processing exploits. 

In particular, the array response vector associated with the covariance matrix model 
for data covering the entire signal bandwidth can be assumed identical to the array 
response vector in the covariance matrix model for data in the narrower bandwidth 
obtained by filtering the SOI. As shown in the next section, this common array 
response vector can be extracted from two covariance matrices, one generated from 
data covering the full signal bandwidth and the other generated from data at the out¬ 
put of a bandpass filter covering a portion of this full signal bandwidth. The processing 
can be made more robust if several disjoint bandpass filters are used, each covering a 
portion of the SOI bandwidth and therefore producing a different subband covariance 
matrix. Each subband covariance matrix produces a separate array response vector for 
its subband signal; these separate array response vectors can be combined to produce 
a “smoothed" array response vector to use in computing a set of signal copy weights. 
Finally, this set of signal copy weights is used to estimate the signal associated with the 
array response estimate for each signal direction. 

As will become evident in the analytical formulation of the spectral difference algo¬ 
rithm, the solution for the array response vectors associated with the covariance matrix 
model requires that the covariance matrix remains full rank even after the noise covari¬ 
ance is removed. This means that before this model can be employed, it is necessary 
to reduce the dimensionality of the signal data to equal the number of signals present 
in the environment. This dimensionality reduction is based on an eigenanalysis of the 
array covariance matrix, leading to a signal number estimate, N s , and projecting the 
data collected by the N-element array onto the N s principal eigenvectors. 

12.5.2 Analytic Description 

The basic array-processing model described in chapter 6 indicated that the covariance 
matrix of the array outputs is computed from the data matrix Z by 

R= —ZZ H . 

L 


(12.39) 
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Here, if z(t) is a vector of array element outputs at a given sample time, then Z is an 
N x L matrix with the z (t) components as its rows. Thus, the N rows of Z correspond 
to signals at the N array elements, and the L columns correspond to the L array snap¬ 
shots. Approaches to signal number estimation were discussed in chapter 7. Here, it 
will be assumed that a signal number estimate has been made and is available for use 
with the spectral difference-processing algorithm that is the subject of this section. 

The data model for the covariance matrix was also discussed in chapter 7. This 
model indicates that the covariance matrix R is related to the array response vector 
associated with the signals in the environment, as summarized by an array response 
matrix V with the i-th column representing the array response vector for the i-th signal 
and the powers of the signals by a diagonal matrix P^, 

R = VT> s V h + N. (12.40) 

Here N is the expected value of the array data covariance matrix when no signal is 
present; that is, the noise covariance matrix. The eigenanalysis of R, and the thresh¬ 
olding of its eigenvalues, has led to an estimate of the number of signals present in the 
data, N s . In this process the noise covariance matrix was also estimated. 

As a result of this covariance matrix eigenanalysis, the principal eigenvectors, cor¬ 
responding to the N s largest eigenvalues, were estimated to provide a basis for the 
so-called signal subspace. As outlined, the spectral difference algorithm requires that 
the dimensionality of the data be reduced to equal N s . In this subspace, the covari¬ 
ance matrix can be represented by a subspace covariance matrix based on the array 
data projected onto the eigenvectors associated with the largest N s eigenvalues of R 
(principal eigenvectors). Denote this subspace covariance matrix by R^ and the input 
data projected onto the principal eigenvectors by z s . 

As described in the overview, spectral difference processing exploits the assumption, 
usually true in practice for the narrowband signals common in HF communication 
protocols, that the array response V is slowly changing with frequency and therefore 
is unchanged among the frequency cells covering the relatively narrow signal pass- 
band. The algorithm assumes that the full bandwidth N-channel input data Z has 
been filtered to the narrow band associated with the SOI. The covariance matrix r, 
characterizes the full bandwidth environment associated with this narrowband signal 
in the signal subspace, including all potential interferers in this bandwidth. This nar¬ 
rowband data stream is then filtered by at least one set of subband filters to produce 
subband data where at least one subband frequency cell is established. A covariance 
matrix is formed in the subband cell(s); each R^ conforms to the model indicated by 
equation (12.41). The subscript k denotes the particular subband filter used to produce 
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the subband covariance matrix R^; if there are several such filters used to increase spec¬ 
tral difference algorithm robustness, this subscript will indicate the particular subband 
covariance involved. 

Next, the details of this approach are described. First, as a technical detail, the V 
used below should be projected onto the signal subspace. To simplify the notation and 
to clarify the discussion, it ‘will be assumed that the number of signals is equal to the 
number of array elements, making the signal subspace equal to the whole space. Later, 
when copy weights are specified (equation (12.53)), this assumption can be relaxed. 
Starting with the subband covariance 

R* = VP*V" + N„ (12.41) 

note that 

VRW = R- N„ (12.42) 

and 

VP k V H = R* - Nj. (12.43) 

Here, N s is the covariance matrix of the noise in a subband. Then, define 

n k = (R f - N,r 1/2 R*(R, - n \r l/2 (12.44) 

and 

Q=(Rs- N,r 1/2 N,(R, - N,r 1/2 . (12.45) 

With this machinery, the difference of these two covariance matrices can be evaluated 

n k -Q=(R s - N,r 1/2 (R* - N,)(Rf -N s y x ' 2 

= UP*P7 ; U*, (12.46) 


where 


u - (vp/v^r 1 , 2 yv 1 p 


(12.47) 


is a unitary. This reduction of 12.46 exploits the fact that P 5 and are diago¬ 
nal. Note that the unitary U is independent of k so that all filter output covari¬ 
ances with normalization as described in 12.44 are diagonizable by the same unitary 
matrix U. 

The spectral difference algorithm described here can then be summarized as: 


1. Select a subband cell at the array output. 

2. Compute its spatial covariance matrix R/ e . 

3. Normalize it as described by 12.44. 
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4. Compute — Q. 

5. Diagonalize the result with a unitary, which is U. 

6. Compute the array response vectors. 

It is important to note that this approach can be made more robust by noting that the 
same unitary will diagonalize any of the matrices defined by 12.44 for any k. That is, the 
unitary does not depend on the filter selection. Therefore, instead of picking a single 
subband cell for diagonalization, one can compute unitaries, U^, for each subband and 
combine them through a smoothing process such that the diagonalization utilizes all 
cells. The result is a more robust estimate of the array response vectors associated with 
the signals in the environment and consequently, a more robust set of copy weights. 
This smoothing process is covered in more detail in the next section. 

Once this robustification is done, 12.47 can be used to obtain the following estimate 
of the array response vector V: 

V=(R i - N,) 1/2 UP7 ;/2 . (12.48) 

Since P is diagonal by assumption, it has little effect on the result since the entries in 
P just scale the array response vectors in the columns ofV by (complex) constants. 

These estimates of the array response vectors V associated with significantly differ¬ 
ent spectral characteristics can be used to form adaptive beams that copy each signal, 
providing array gain and suppressing the other signals as interference. This is done in 
accordance with the overall R -1 v approach described earlier in chapter 7. Here this 
process is complicated by the projection onto the signal subspace that was done early 
in the spectral difference processing. 

Recall that to simplify notation in the algorithm description, it was assumed that 
the number of signals was equal to the number of sensors. In actuality, this will rarely 
be the case. As a result, the weights found in equation 12.52 will be relevant for signals 
projected into the reduced dimension subspace. In order to apply these weights, the 
snapshot data matrix Z should be projected into this subspace and the weights applied 
to it there. 

z s = EsE? Z, (12.49) 

where Ej is an N x N s matrix of the principal eigenvectors of the covariance matrix R. 

In this reduced dimension subspace, the normalization specified by 12.44 is \}J 2 = 
(Rj — Nf) 1 / 2 . Then, in this subsp ace the desired copy weights become 

W,=V(Z,Zf)- 1 . 


(12.50) 
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But in the subspace, the relationship between the array response vector and the unitary 
found in 12.48 becomes 

z s = (C) 1/2 u. (12.51) 

Note that the irrelevance of the diagonal matrix representing the signal powers has 
allowed V~^ 2 to be assumed equal to the identity, I. 

Then, 12.50 becomes, after some manipulation 

W, = (L,r 1/2 U. (12.52) 

In the original space, the relevant copy weights are determined using the eigenvectors 
defining the signal subspace. That is, 

W = E/W,. (12.53) 

As a result, the signal estimates become 

S = W H Z s . (12.54) 

Before presenting an example, the subband smoothing process will be described in a 
little more detail. 

12.5.3 Subband Smoothing 

Returning to the desire for robustness, recall that the diagonalizing unitary matrix 
U was shown to be independent of the filter used to create a subband. If several fil¬ 
ters are used, each will produce a slightly different unitary due to data differences 
arising from noise, imperfect sampling circuits, and more. It is helpful to combine 
these unitaries because such a combination will produce a single unitary that in 
some sense smooths any subband-to-subband errors. This process is carried out by 
reshaping each unitary to be a vector data snapshot of a stationary process, forming 
a data matrix with these vectors as columns, computing a covariance with this data 
matrix, and finding its principal eigenvector. This principal eigenvector can then be 
reshaped to form the sought-after smoothed unitary. This is in some sense produc¬ 
ing an “average" of these unitaries but with a nonlinear process quite different from 
a simple averaging of components in the set of unitaries obtained from the various 
subbands. 

Specifically, each unitary is vectorized (stretched into a vector generated by stacking 
the columns of a unitary); then each vector is assembled into a pseudo-data matrix D 
with these vectors as columns; next, a pseudo-covariance matrix is formed by DD^; 
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finally, the principal eigenvector of this matrix is obtained and used to represent 
the smoothed unitary U> Uy is then used to determine the requisite copy weights 
through 12.52 and 12.53, with the smoothed unitary Lf used in place of U. 

12.5.4 Example 

To illustrate the capabilities of spectral difference processing, the scenario described 
in chapter 7 is utilized. Recall that this scenario involved a 100 m diameter circular 
array with eight elements at random locations around the circumference. Two 15 MHz 
signals are arriving at the array, one of which is a BPSK signal with element SNR = 
28 dB, while the other is an interfering tone with element SNR — 20 dB. The array 
outputs are complex sampled at 5 kHz, a rate appropriate for signals with bandwidths 
less than 5 kHz according to the Nyquist criterion. Receiver noise is modeled as white 
Gaussian noise with unit variance. 

Signal characteristics. The BPSK signal utilizes a carrier of 800 Hz; eight successive 
samples of this carrier are transmitted with the same phase, either 0 or Jt rad to 
represent the binary information to be transmitted. For purposes of this example, 
the binary information bits are assumed to be-random, with no special structure 
that might be present when a specific transmission protocol is used. As discussed 
in chapter 3, this signal has a bandwidth of 5000/8 = 625 Hz. Its amplitude is 50. 
The second signal is a tone at 1200 Hz with an amplitude of 20. Figure 12.12 shows a 
periodogram of these two signals with receiver noise. The BPSK signal impinges on 
the antenna array from an azimuth of 10° and an elevation of 15°; the tone arrives 
from an azimuth of 80° and an elevation of 20°. 

Filtering with an eight-channel filterbank. After reception and digitization at the 
5 kHz sample rate, the signals at each antenna element are passed through an 
eight-channel filter bank with channels as shown in figure 12.13. This filter bank 
reduces the sample rate by a factor of eight and correspondingly reduces the band¬ 
width at each filter bank output. Figure 12.14 gives a periodogram of the lower 
four outputs from these filter bank ports covering bands 0 — 2500/8 (0 — 312.5), 
2500/8-2500/4 (312.5-625.0), 2500/4-3*2500/8 (625.0-937.5), and 
3 * 2500/8 — 2500/2 (937.5 — 1250.0). The BPSK signal peak is seen in the third 
subplot, and the tone is visible at the upper edge of the fourth. The outputs of all 
eight ports in this filter bank are to be used in the final output, but it is interesting to 
examine a spectral difference result using samples from only one of the eight ports. 
Such results are presented next. 
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Figure 12.12 Periodograms of the two example signals 



Figure 12.13 Channel specification for the eight-channel filter bank 

Copy results with single filter. Suppose a particular output port is chosen to apply 
the spectral difference algorithm. The sampled data at the chosen port is used to 
form a covariance matrix in accordance with equation (12.41); this covariance matrix 
is then utilized in accordance with equation (12.42) through equation (12.47), 
at which point a unitary is available for diagonalization, yielding a set of copy 
weights. These weights are used on the array signal vector data to provide the desired 
estimates of the assumed two signals present. Examination of these results in con¬ 
junction with the periodograms presented in the previous subsection shows that the 
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Figure 12.14 Periodograms of the signals seen at the first four eight-channel filter bank 
ports 
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Figure 12.15 Copy results for spectral difference processing example 
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Table 12.1 Mean square errors after spectral difference copy 


Spectral port 
used for SD 

ME in 

BPSK 

MSE in 

sinusoid 

1 

0.0956 

0.0637 

2 

0.1130 

0.0916 

3 

0.0623 

0.0255 

4 

0.0540 

0.0105 

5 

0.0546 

0.0111 

6 

0.0855 

0.0502 

7 

0.0886 

0.0545 

8 

0.0859 

0.0510 

All 

0.0575 

0.0145 


quality of the signal copy depends on the spectral representation of the signals in 
that band. For example, the fourth filter port provides copy weights most suited to 
the tone not a surprise since the tone is entirely contained in this filter bank port 
data. 

Table 12.1 shows the mean square error in the spectral difference output waveforms 
for each of the two signals used in this algorithm simulation. The first eight rows show 
the results of using the spectral difference copy weights obtained using only the port 
specified. The last row shows the results using the robust copy weights as described 
previously. Although the latter results do not show the robust copy weights to produce 
better results than those produced by any of the individual subband channels, the 
robust weights produce results that are almost as good as those using the best subband 
channel. This suggests that the robust weights will do well and at the same time be less 
susceptible to strong interference in one subband. 

It is appropriate to explore the relationship between the power in the subband 
channels and the mean square errors obtained using the weights derived from the 
corresponding subbands. The upper portion of figure 12.15 shows, for the example pre¬ 
sented here, the power in each of the eight subbands. The lower portion (dark curve ) 
shows the mean square error (MSE) using the spectral difference weights derived from 
the one subband indicated on the abcissa. It is clear that there is a direct relationship 
between the signal power in a subband and the accuracy provided by the spectral differ¬ 
ence copy approach using that channel. This subplot also shows the mean square error 
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Figure 12.16 Copy results using spectral difference processing to recover two spectrally 
overlapping signals 


obtained with the robust approach using all eight subbands. As shown in table 12.1, 
this mean square error is almost as good as that obtained using the best subbands 
alone. 

Copy results using all filters and “smoothing.” Following the approach described 
in the subsection unitary averaging, the eight unitaries computed at the eight output 
ports are combined to form a composite unitary used to generate a set of copy weights 
manifesting some diversity regarding spectral content. The mean square error resulting 
from such processing is shown in the last row of table 12.1. It is clear that the subband 
smoothing approach improves the copy results substantially. Portions of the recovered 
waveforms are illustrated in figure 12.16. 

12.6 On-Off Keyed Signals 

Next, consider another important HF operating environment with both uncorrelated 
receiver noise and various sources of cochannel interference. Assume a new On-Off 
keyed signal s(t ) arrives at the array. The goals are to develop an approach for detecting 
this new signal arrival and for generating an appropriate copy vector to apply to the 
array output N-channel signal matrix. This copy vector should provide effective copy 
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of the new signal with minimal interference from other signals in the environment. 
This section focuses on developing an appropriate copy weight vector for such signals. 

12.6.1 Algorithm Description 

The new signal arriving at the array leads to the complex baseband data vector z (t) 
given by 

z (t) = xs(t) + i(V). (12.55) 

Here, i(^) is a vector signal because the various array elements observe each of the 
interfering signals with steering vectors corresponding to their directions of arrival 
and the receiver noise samples are different element to element. 

Recovery of the signal s(t ) is to be achieved through a complex beamforming vector 
w to provide a single-channel output 

= w // [vi'(^) + i(*)], (12.56) 

where w is computed to maximize the signal-to-interference-plus-noise ratio in ?(/•). 
This quantity is 


w^v 2 (|s(t) 2 ) 

(12.57) 

w^R^w — w^R^w 

(12.58) 

w H Raw 

w H R zz w 

(12.59) 

AjjCj ^ 

~ w w R„w 

= A — 1, 

(12.60) 


where P s = (K^)| 2 ) is the signal power, and R z/ = and R^ = 

are the autocorrelation matrices of interference and received data, respectively. 

If R u is known, or can be estimated from the array data received prior to the new 
signal start, y (w) would be maximized by the eigenvector corresponding to the largest 
eigenvalue of the generalized eigenequation 

R^w = AR#w. ( 12 . 61 ) 

Solution of this eigenequation produces a maximum eigenvalue that is a function of the 
two covariance matrices R^ and R^, X max (K R/v) with a corresponding eigenvector 
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e max (Rzz>Rz'z')> satisfying 

R-zz^max — ^ max^ii ^max • 

Since =yP s v^ + R zz , where P s is the power in the SOI, this becomes 

(vPsV^ T - ~ ^max^ii^max 5 


(12.62) 


(12.63) 


or 


R/f vPgV 77 e m ax — i?^max ty^max — ^ ^max- (12.64) 

Here vP s v^ is a rank -1 matrix, suggesting it has only one nonzero eigenvalue; whiten- 
ing by R 2 1 does not change its rank. But the product of the last two vectors on the 
left side of equation (12.64), e max , is a scalar, leading to the eigenequation 


R zz lyP z = A"e max > (12.65) 

where X" = X' /'v 11 c max . It is straightforward to show that R^'vPj is an eigenvector of 
R2 Vp s v^. But, due to the rank-1 character of this composite matrix, it has only one 
nontrivial eigenvector. Hence 

® max ~ ^5 (12.66) 

where g is a scalar. Thus, v can be estimated as g f Rne ma x> where g' normalizes v to unit 
norm. This result is well known [3]. 

Inserting this eigenvector into the eigenequation (12.64), one obtains 

X max - 1 = PsV^R^ V. (12.67) 

But from equation (12.60), ^ m ax — ^~\- Yopt so that the optimum signal-to- 
interference-plus-noise ratio is then 

Yopt = P s v* R^V (12.68) 


With this machinery, consider the reception of an intermittent signal. If an array cap¬ 
tures the environment while the signal is off, the array data will be z (t) — i(t). But if 
the signal is on, zfy) = \s(t) + ify). Thus, data collected while the intermittent signal 
is off is characterized by R^ = R zz ; data collected when this signal is on is characterized 
by R zz = R„ + Psvv^. 


12.6.2 Intermittent Signal Estimation (Copy) Approach 


Chapter 7 focuses on the detection of signals; for intermittent signals, there is a close 
relationship between detection and copy. If the observation time interval is segmented 
such that the segments are short relative to the time scale of changes in propagation 
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Figure 12.17 OOK signal copy logic and timing 


conditions so that the signal steering vector remains largely unchanged throughout 
each segment, and if an array covariance matrix is computed for each segment, eigen- 
analysis of this covariance will provide data suitable for obtaining a copy weight vector 
corresponding to each segment. 

When the intermittent signal is on, a copy weight vector can be found from equa¬ 
tion (12.66); when it is off, the interference environment can be characterized anew 
by updating the interference covariance matrix R z/ for use in subsequent segments 
until the signal has turned off again. Appropriate threshold comparisons can be 
implemented using the optimum signal-to-interference-plus-noise ratio obtained from 
equation (12.68). 

Figure 12.17 suggests how this might be carried out. 

12.6.3 Algorithm Summary 

This algorithm can be summarized by the following series of steps. The algorithm 
implemented in the example to follow explicitly assumes that the signal is off at the 
start of this process; at least one off interval is needed to provide an initial estimate 
of R n . 

• Divide the time line into a series of short segments suited to the SOI timing. 

• Form the autocorrelation matrix R^ for each segment. 
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• Proceed through the segments, one at a time. 

• For the first segment, in which only interference is assumed to be present, use 
this result for R zz . 

• For all subsequent segments, carry out the generalized eigenanalysis, 
equation (12.61). 

• Compare the largest eigenvalue to a threshold. 

• If the largest eigenvalue is not above the threshold, update R zz using data from 
the current segment. 

• If the largest eigenvalue is above the threshold, a signal has turned on in the 
current segment under test. 

• In this case, use the eigenvector corresponding to this eigenvalue as a copy weight 
vector for this segment. 

• Estimate the power in the segment copy result. 

• If this power is below a second threshold, the SOI has turned off and updating 
R/z can resume. 

• Continue with the next segment. 

12.6.4 OOK Example 

This example is based on the application of the On-Off keyed algorithm just described 
to reception of a Morse-coded CW signal buried in interference. Timing of the coding 
is based on a common Morse rate of 20 words per minute (wpm), which translates 
into approximately 20 dot units per second. This rate implies that the duration of the 
Morse-encoded signal for the word “PARIS,” a standard keying test word, is 2.5 s. 
In the example simulation, this signal arrived at a four-element linear array (half¬ 
wavelength element spacing) from an angle of 20°. Its signal-to-receiver-noise ratio 
(SNR) was 20 dB. 

Also impinging on the array is an interferer with a Gaussian-modulated, bandwidth- 
limited signal that is 20 dB stronger than the SOI. It arrives from an angle of —-30°. 

The algorithm just summarized was operated for the duration of the message, “The 
quick sly fox jumped over the lazy brown dog.” As noted in the algorithm description, 
two thresholds are needed, one for new signal start decisions and a second for a signal- 
end decision. For this simulation, these thresholds were chosen by trial and error. 
However, analytical approaches to threshold settings could be used [3, 12]. 

Figure 12.18 provides a pictorial view of the results. The top panel shows the 
transmitted modulation of the 22 s signal; the middle panel shows the composite signal 
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Figure 12.18 OOK signal copy example results (arbitrary scales) 

and interference arriving at array Element 1. Clearly the Morse-coded signal is diffi¬ 
cult to make out. An audio replay of the signal does not evidence any indication that 
a Morse signal is present. Finally the bottom panel shows the copy output. Although 
there is noticeable variation in the gain of the process throughout the message, the 
timing is well defined, and an audio replay suggests that the Morse message could be 
decoded with few, if any, errors. 

Such a simulation could be used to characterize algorithm performance as a function 
of various system parameters: number of antenna elements, signal SNR, interferer 
level, threshold settings, and more. But such investigations are outside the scope of this 
example, the purpose of which was to demonstrate a signal copy technique designed 
for On-Off keyed signals. 

12.7 Radar Signals 

Most of this text focuses on communication signals, but the adaptive N-channel- 
processing techniques can be applied to radar signals as well. Constant modulus (CM) 
algorithms are especially appropriate because radar transmitters, in their quest to 
maximize target range performance, which improves only as the fourth root of trans¬ 
mitted power, utilize CM signals almost universally. Too much effort goes into the 
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development of a high-power transmitter to waste any of its potential range on the 
niceties of an amplitude-modulated waveform. 

This section addresses how array-processing methods can be utilized for processing 
radar signals, with most detail reserved for the linear FM modulation. As discussed in 
the next subsection, such a radar signal is used to provide needed range performance 
without sacrificing range resolution. 

12.7.1 Radar Pulse Formats 

Unmodulated pulse radar signals typically utilize very short pulse durations in order to 
satisfy range resolution requirements. Since range resolution is directly proportional 
to pulse duration, a signal must be shorter than perhaps 1 /zsec, with a bandwidth 
greater than 1 MHz, if it is to have a useful range resolution specification. Even a 
signal with this duration provides marginal range resolution, having a nominal capa¬ 
bility of 300 m. More useful unmodulated radar signals might (possibly) have an even 
shorter duration of perhaps 0.1 /zsec, a bandwidth equal to 10 MHz, and a nominal 
resolution of 30 m. Such signals could be processed by an HF receiver. However, the 
extreme bandwidth (for HF) will not have predictable propagation support, and in 
addition, phase variations over such a wide HF passband will make coherent processing 
a challenge. 

For these reasons, HF radars typically employ linear FM signals. Bandwidths lim¬ 
ited to tens of kilohertz are used to assure consistent propagation support. Frequently, 
such radars employ an FM sweep to provide sufficient energy on target and rely on the 
sweep bandwidth and coherent processing over extended time periods for adequate 
range resolution and operating range performance. Such extended processing implies 
that Doppler shift must be accommodated as a component of the signal-processing 
suite. 

12.7.2 Linear Frequency Modulation Processing 

With this justification for the emphasis here on processing radar signals with linear 
FM formats, attention turns to the processing itself. 

In contrast to communications signal processing, radar signals do not inherently 
carry unknown data. The waveform itself is known in detail, and its utility is derived 
from observing the delay, Doppler shift, and amplitude of a target return. Because 
the waveform is known, a matched filter can be employed to capture the signal 
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energy associated with a specific reflection time, and a bank of such filters can be 
used to derive Doppler shift. Signal amplitude provides information concerning target 
size. With such matched filters, the energy associated with an extended waveform 
can be compressed into a single pulse with duration commensurate with the signal 
bandwidth. 

A matched filter for a known linear frequency swept signal can be implemented as 
a complex correlation with a replica signal incorporating a test Doppler shift. In the 
correlation process, such a reference waveform is scanned through the incoming signal 
in both range and Doppler in accordance with 

z mf(f> r ) = ^2 *(**)**(**' “ r ) ex P (12.69) 

n 

Such range-Doppler processing can be implemented using a two-dimensional Fourier 
transform in which the cross-correlation with the linear FM sweep is carried out in the 
frequency domain. This approach to processing is used in the example presented next. 

As an example, a number of coastal radars have been deployed to study ocean behav¬ 
ior, and these radars use a linear swept-FM signal covering about 26 kHz of bandwidth, 
providing 6 km range resolution. Coherent processing for several seconds leads to sub- 
Hertz Doppler resolution. Any array-processing technique should exploit the known 
waveform structural features as a prelude to antenna-based processing. Thus, matched 
filters should be employed on each of the antenna channels; then, the outputs of these 
matched filters serve as the starting point for array processing to follow. 

Figure 12.19 provides an overview of this array processing. Discussions here will 
be limited to the direction-finding (DF) function. Assuming the channel-matched 
filters are appropriately matched to each other, or alternatively are well calibrated, the 
correlators will preserve channel-to-channel phase relationships. Then, the matched 
filter outputs can provide snapshots to be used in covariance matrix formulations, as 
with communications signals. It is only necessary to synchronize any snapshots to 
be combined in this way with the radar signal repetition period. This implies that 
the range interval to be processed is divided into range resolution cells and each cell 
processed separately. 

This approach to processing has been rigorously justified by Kelly [4] in a formu¬ 
lation based on a generalized likelihood ratio test (GLRT). The technique described 
there is based on a radar model in which the transmitted signal is known to the receiver. 
Kelly derives an appropriate weight vector to be applied to the signals from each of 
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Figure 12.19 Overview of linear FM radar signal processing with an array 


the multiple antennas assumed. Multiple radar snapshots can be incorporated into the 
processing in a straightforward manner. 

This methodology will be illustrated with an example. A radar with a sweep width 
of 20 kHz and a sweep (pulse) repetition period of 100 ms is assumed, leading to a 
total of two thousand range cells. Successive vectors of correlator bank outputs, at 
time epochs 100 ms apart, provide the snapshots need for covariance formulation. If 
direction finding is the radar goal, the range-Doppler image can be used for target 
detection. Once the detection process identifies the range-Doppler bin associated with 
a target, the complex signal returns from that range-Doppler bin from each antenna 
element can be used to build the covariance matrix needed to produce a MUSIC-based 
DF result. 

Furthermore, if target detection is the goal, the separate antenna streams from all 
correlator output bins can be used to build change matrices for each bin, as described 
chapter 7. 

Finally, if a covariance matrix is constructed as described, it can be used with a 
MUSIC-based steering vector to provide a set of copy weights. If interference is to 
be suppressed, a covariance matrix that contains the interference, and not the SOI, 
would be required. Depending on the environment, such a covariance matrix may be 
computed using snapshots corresponding to nearby target ranges. If the radar is used 
for essentially point targets and the interference is due to extended targets, such an 
approach would be appropriate. Other scenarios will require alternate approaches to 
determining a useful interference covariance matrix. 
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12.7.3 LFM Processing Example 

The radar previously described, with a linear FM sweep of 20 kHz in 100 ms, is used 
as an example to illustrate some of the details. Figure 12.20 shows the phase of one 
sweep of this radar. With such quadratic phase, a linear frequency sweep is generated. 
A Doppler shift of 3 Hz was applied to this radar signal, emulating a moving target 
return. In this example, a linear antenna array with half-wavelength spacing was used to 
receive the radar signal, which was postulated to emanate from an angle of 20°. Three 
successive sweeps were processed by a correlation process that implements a matched 
filter to a single sweep at a specific Doppler velocity; each antenna channel signal was 
processed in the same way. Because three such sweeps were processed through the 
matched filter, the output, shown in figure 12.21 for one Doppler shift/antenna pair, 
indicates three sharp peaks. The timing of the peaks relative to the correlator timing 
reference is a measure of the target range, in this example zero (processor delay causes 
the peak to be at about 10 ms, the sweep duration), while the Doppler channel showing 
the strongest temporal peak provides a measure of the target velocity. In the example 
processor, nine Doppler channels were implemented. 

After range-Doppler processing of all antenna channels, which implements a 
matched filter for all Doppler velocities, the results are examined to determine which 
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Figure 12.20 


Radar waveform phase for the linear FM processing example 




Copy: Feature Exploitation Methods 329 



Time (msec) 


Figure 12.21 Correlation-processing output for linear FM radar signal example 

of the range-Doppler channels have the strongest peaks. Figure 12.23 shows the mag¬ 
nitude of the peaks from all nine channels, over a limited portion of the range sweep, 
indicating that the processor provides the strongest peak in the channel correspond¬ 
ing to 3 Hz, the proper Doppler channel, and at a range corresponding to the sweep 
duration (simulated target at zero range). The same behavior is seen in each antenna 
channel. 

Once the range-Doppler channel with the strongest return has been determined, 
the complex magnitudes of the peaks in this Doppler channel for all antennas are col¬ 
lected to obtain data vectors for computing the requisite covariance matrix; one such 
vector is obtained for each set of radar sweeps used for Doppler processing. Once 
this has been completed, the vectors are combined to form a covariance matrix in the 
usual way: 

R= J2 W Z mf ( 12 - 7 °) 

sweeps 

In this example, three such vectors were used to form the covariance matrix. Then, 
direction finding was implemented using the MUSIC algorithm. The covariance 
matrix was eigenanalyzed, and the eigenvectors associated with the smaller eigenval¬ 
ues were used to specify the noise subspace. Then, the array manifold associated with 
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Figure 12.22 MUSIC-based DF result for linear FM radar signal example 

the array geometry was searched for a steering vector that is orthogonal to this noise 
subspace, as MUSIC prescribes. The result of this process is shown in figure 12.22, 
which shows the MUSIC spectrum, a plot of the inverse distance of all manifold 
steering vectors from the noise subspace. Clearly, there is a strong peak in the MUSIC 
spectrum, suggesting that the processing algorithm described earlier works well. In 
this example, the steering vector on the array manifold and the eigenvector asso¬ 
ciated with the correlation peak-based covariance matrix were 0.0001 beamwidths 
apart. 

12.8 Voice-Modulated Signals 

Voice traffic over HF links is relatively common; many users are drawn to HF because 
of the potentially low-cost, long-distance communications. Voice signals can use one of 
the amplitude modulation approaches discussed in chapter 3; frequency modulation 
provides poor performance because of the very limited bandwidth available in the 
HF bands. However, because of its bandwidth efficiency, single-sideband suppressed- 
carrier (SS-SC) modulation appears to be the most popular format. 

Unfortunately, SS-SC modulation lacks a salient waveform feature that can be 
exploited for array-based signal copy. As a result, voice signal copy is a relatively 
challenging topic for consideration here. Nevertheless, this challenge provides the 
motivation to introduce a relatively new approach to array signal copy by Schell and 
Gardner [9]. This technique, programmable canonical correlation analysis (PCCA), is 
related to those approaches, discussed earlier in this chapter, that exploit a training 
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Figure 12.23 Doppler-processing output for linear FM radar signal example 


signal. But PCCA offers extreme flexibility, generating a pseudo-training signal 
through creative spectral filtering of the array element signals. The filtering effectively 
transforms the arriving signal wave vector into an alternate multidimensional space 
without changing the phase relationship among the signal vector components estab¬ 
lished by the spatial sampling. After this filtering, the unknown relationships among 
the spatial samples; that is, the steering vectors needed for signal copy, can be identified 
by the multivariate analysis technique known as canonical correlation analysis. 

12.8.1 Canonical Correlation Analysis Methodology 

For a thorough grounding on canonical correlation analysis, the reader should consult 
a basic text on multivariate analysis [16]. Only the general principles of this approach 
will be reviewed here. Basically, the method applies to situations in which the analyst is 
presented with two data groups, both relevant to the topic but each providing different 
information, or similar information, in a different form. Each group is characterized by 
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a sequence of observed data vectors; the analyst is interested in determining a common 
relationship among the two vector sets. 

In more abstract terms, identify observations from the first data group with a vector 
xj and observations from the second data group with a vector x 2 . One can repre¬ 
sent these joint observations by a “super-observation” using the vector x. With this 
notation, a sample covariance matrix for the super-observations can be considered. 



This covariance matrix can partitioned in the usual way. 



S;2 

S22 


(12.71) 


(12.72) 


This question of interest can be posed: Considering linear combinations of observa¬ 
tions from the first data group, u — a^x/, and linear combinations of observations 
from the second data group, v — b^X 2 , what is the largest possible correlation? This 
question can be addressed by maximizing the covariance between u and v under the 
constraint that the variances of these variates are unity: var{u) — var{y) — 1. 

The covariance between the elements of xj and x 2 are in the submatrix Sj 2 , while 
the variances and covariances within elements of each data group are contained in the 
sample covariances Sjj and S 22 : 

cov(u , v) = a^Sj^b, (12.73) 

var(u) — a^a, (12.74) 

var(v) — b H S 22 b. (12.75) 

The desired solution can be found using Lagrange multipliers \\ and A 2 to maximize 

V = * H S 12 b - Afya^a- 1) - ki(b H S 22 b - 1). (12.76) 

Differentiating V with respect to the complex quantities a and b and setting the results 
equal to zero produces the equations 

S i2 b-AiS;;a-0, (12.77) 

S 2 ;a - X 2 S 2 2 b = 0. (12.78) 

These two relations can be solved with the aid of the constraints & h Su2l— 1, b^S22 
b = 1, and, assuming b^S 2 /a is real (justified by the assumption that it is maximized) 
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to obtain 

A 2 = b^S^a = $ 12 $ = X\ = K. \ (12.79) 

Here the scalar AT has been introduced since the two Lagrange multipliers are seen to 
be equal to each other and equal to the desired maximum correlation between u and v. 

Returning to the equations specifying the Lagrange multipliers, equation (12.78), 
one obtains 


KS n a = S 12 b , (12.80) 

K$22$ = $2i3- (12.81) 

Solving for a results in 

z=^Sj/S 12 b (12.82) 

K 

so that 

(S2iSY/S 12 - K 2 S 22 )b = 0. (12.83) 

Similarly, one can solve for b to obtain 

($12$ 22 $21 ~ K 2 Su) a = 0. (12.84) 

These relationships show that a is a generalized eigenvector in the relation 

$ 12 $ 22 $ 213- — K 2 $ u a (12.85) 

and that b is a generalized eigenvector in the relation 

$21$ii $ 12 $ = K 2 S22$ • ( 12 . 86 ) 

An alternative pair of eigenequations can be obtained if these two are premultiplied 
by Sj/ and > respectively. 

The resulting pair of ordinary eigenequations becomes 

Sj/S 12 S 22 S 21 & = K 2 a., (12.87) 

SY 2 S 21 Sj/S 12 b = K 2 b. (12.88) 

These eigenrelations will have more than one eigenvalue/eigenvector pair; the number 
of nonzero eigenvalues in each will be N e = rank(Si 2 ) < min(p , q), where p and q are 
the lengths of the vectors x; and x 2 . Because the eigenvalues will be distinct, a single 
pair of the variables u and v will not be adequate to quantify the relationship between 
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the two sets x/ and x^. Rather, N e pairs will be needed as specified by the inner products 

Ui — a^x/, (12.89) 

Vi = bfx2 (i = l, (12.90) 


The result of these manipulations is that the data has been transformed from the orig¬ 
inal two group coordinates x to new coordinates y=[n v]. In the y coordinates, one 
can show that the sample covariance matrix has the form 



D 

I 


(12.91) 


where I is an identity matrix of size N e , and D is a diagonal matrix of D eigenval¬ 
ues. Thus, the transformation produces normalized within-group variates as well as 
uncorrelated variates between groups. These diagonalization techniques become the 
foundation for the PCCA approach to signal estimation that will be described in the 
next section. 


12.8.2 Programmable Canonical Correlation Analysis 

The mathematical machinery described in the previous section provides an approach 
to signal estimation from array data that can be adapted to a wide variety of signal types 
by designing an appropriate transformation [9]. This section focuses on the application 
of canonical correlation analysis to array data processing; in the section to follow, the 
application of the technique to an SS-SC signal format will be illustrated by example. 

PCCA can be summarized by a block diagram of the form shown in figure 12.24. 
The vector of array data z is linearly processed by two parallel paths. The first of these is 
the normal signal copy channel, in which the signals at the array elements are weighted 
by an appropriate set of copy weights w s in the normal manner. The second path 
employs a transformation, or filter, to each channel of the array data before imple¬ 
menting a similar set of copy weights w> to the resultant filter output vector. Here the 
subscript s has been used to designate the “signal” channels while the subscript t is 
used to designate the “transformed” channels. 

The goal is to find the two sets of copy weights that minimize the difference between 
the two copy outputs. If the transformation (filter) is chosen such that the transformed 
path signals emphasize the characteristics of the SOI relative to those of the interfer¬ 
ence, the weights on the signal path will de-emphasize the interference and provide a 
low-interference version of the array data at the signal copy output port. 
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Figure 12.24 Conceptualized PCCA processing 


This approach can be formulated in a way that is parallel to the canonical correla¬ 
tion analysis method by defining a transformed copy signal t on the transformation 
(filter) path, noting that the comparable copy signal on the antenna data path is the 
desired signal estimate s. The goal is to find appropriate copy weights that minimize 
any discrepancy between the two signals. Mathematically, this can be described as 



( 12 . 92 ) 


subject to the constraint that — I and similarly for W^. 

The applicability of the mathematical approach described in the previous section 
should be apparent. Thus, the desired pair of spatial filters can be ascertained as the 
dominant eigenvectors from the two eigenequations 


~ —7 ^ —7 ~ H 

^zz ^zy^z,i ~ 


= AjWyj, 


(12.93) 

(12.94) 


where the dominant L eigenvalues are utilized. In practice, only the first of these 
equations need be solved since the transformed copy signal is not required. 


12.8.3 SS-SC Signal Example 


As an example of PCCA processing for voice-modulated signals, a synthetic voice sig¬ 
nal was generated based on Gaussian random samples for both amplitude and pitch. 
Pitch was centered at 200 Hz but varied with a variance of 20 Hz about this center 
frequency. Figure 12.25 shows the overall characteristics of this synthetic voice signal; 
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Voice modulation 



Figure 12.25 Characteristics of synthetic voice modulation for SS-SC signal 



2000 2050 2100 2150 2200 

Sample number 

Figure 12.26 Samples of the synthetic voice modulation for SS-SC signal 
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Figure 12.27 Baseband signal copy results after PCCA processing: a) transmitted base¬ 
band signal of interest, b) channel 1 PCCA baseband output, c) channel 2 PCCA baseband 
output, d) channel 3 PCCA baseband output, e) channel 4 PCCA baseband output 

figure 12.26 shows some samples of the baseband waveform at a random point in the 
complete waveform. The baseband waveform was then used to amplitude modulate a 
carrier and the result filtered to produce the SS-SC signal for inclusion in the signal 
environment as the SOL This signal was simulated to arrive at the listening antenna 
array from an azimuth of 20°. 

A second signal, the signal-not-of-interest (SNOI) with Gaussian noise amplitude 
modulation, and at the same level, was generated and simulated to arrive from an 
azimuth of —30°. Both signals were at approximately the same power level, 30 dB 
above the simulated noise. The listening array consisted of four antenna elements with 
half-wavelength spacing at an RF frequency of 12 MHz. This geometry and frequency 
put the two signals a beamwidth apart. 

PCCA processing, as described in the previous two sections, is carried out on 
the four-element array signals. The array element signals are narrowband filtered to 
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eliminate out-of-band signals from the processing. Then the resulting signals are passed 
through a filter that affects the two signal types differently. In this example, the filter 
was identical to the SS-SC filter used to eliminate the carrier and lower sideband from 
the amplitude-modulated signal generated as part of the SS-SC signal generation pro¬ 
cess. Thus, the SOI was unaffected by the filtering, while the noise-modulated signal 
was significantly impacted. Then, the statistics of the antenna signals and the filtered 
signals were used to compute appropriate copy weights. 

Recall that several copy weight vectors are generated; the number of vectors is 
equal to the number of antenna elements. PCCA processing must be supplemented by 
postcopy processing to determine which set of copy weights produces the SOI. Such 
processing has not been investigated in detail for this example, but the proper SOI 
channel will be apparent from the results. 

Figure 12.27 shows the baseband signals obtained by demodulating the RF signals. 
It should be obvious that the process is working well. In the simulation, the Matlab 
“soundsc” command was used to produce audio of the transmitted signal as well as the 
signals in each of the copy channels. The listener’s ears can confirm the effectiveness 
of PCCA processing in recovering the SOI. 
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